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Preface 



Strain is the main tool to boost current and enhance performance of advanced 
siUcon-based metal-oxide-semiconductor field-effect transistors (MOSFETs). Mod- 
eling and understanding of strain effects on band structure and mobility has become 
the important task of modern simulation tools used to design ultra-scaled MOS- 
FETs. This book focuses on modern modeling approaches and methods describing 
strain in silicon. Contrary to the valence band, strain-induced conduction band 
modifications have received substantially less attention. Peculiarities of subband 
structures in thin semiconductor films under stress are investigated in detail using 
numerical pseudopotential calculations as well as a k-p theory, which includes 
the two lowest conduction bands. Implementation of strain in transport modeling 
for modern microelectronics design tools is overviewed. Application ranges from 
device modeling to applied mathematics and software development. 

The book is based on my research and partly on my course of lectures given for 
the Master's and PhD students in electrical engineering, microelectronics, physics, 
and applied mathematics at the Institute for Microelectronics, Technische Univer- 
sitat Wien. This book would not have been written without the support of the Insti- 
tute for Microelectronics and its Director Univ.Prof. Dipl.-Ing. Drtechn. E. Langer. 
I would like to thank Univ.Prof. Dipl.-Ing. Drtechn. T. Grasser, Univ.Prof. Dipl.- 
Ing. Dr.techn. H. Kosina, and Univ.Prof. Dipl.-Ing. Dr.techn. Drh.c. S. Selberherr 
for their overwhelming encouragement, support, and help in writing the book. 

I would like to acknowledge the contributions to the book made by my colleagues 
and co-authors: O. Baumgartner, J. Cervenka, S. Dhar, T. Grasser, A. Gehring, 
G. Karlowatz, M. Karner, H. Kosina, K. Likharev, M. Nedjalkov, M. Pourfath, 
F. Schanovsky, S. Selberherr, Z. Stanojevic, M. Vasicek, E. Ungersboeck, and 
T. Windbacher. I also would like to thank H. Ceric, R. Entner, O. Ertl, W. Goes, 
P. Hehenberger, R. Heinzl, S. Holzer, A. Makarov, G. Milovanovich, N. Neo- 
phytou, R. Orio, V. Palankovski, K. Rupp, P. Schwaha, I. Starkov, F. Stimpfl, 
O. Triebl, S. Tyaginov, S. Vitanov, P. Wagner, S. Wagner, J. Weinbub, and W Wess- 
ner for many fruitful and stimulating discussions and C. Haslinger, E. Haslinger, 
M. Katterbauer, and R. Winkler for technical support in preparing the manuscript. 
Special thanks go to O. Baumgartner, M. Nedjalkov, and K. Sitzwohl, who kindly 
agreed to take the heavy duty of proof-reading and improving the manuscript. 
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dation EUROCORES Program FoNE funded by the Austrian Science Fund FWF 
(project I79-N16), CNR, EPSRC and the EC Sixth Framework Program. 
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Chapter 1 
Introduction 



Introduced in mass production at the beginning of the 1970s, the Metal-Oxide- 
Semiconductor Field Effect Transistor (MOSFET) is the key element of modern 
integrated circuits. Although the transistor feature size has shrunk dramatically over 
the past three decades, its overall design stayed nearly the same until recently. Even 
the 90 nm technology node MOSFETs introduced in 2004-2005 and still found in 
nowadays computers are based on the same principle and consist of the same basic 
elements as three decades ago. The inversion channel, which connects the source 
and drain electrodes, is formed at the silicon interface by applying a certain volt- 
age to the gate electrode. The gate electrode made of heavily doped poly-silicon is 
electrically separated from the inversion channel by an oxide layer. A high quality 
silicon dioxide is resilient against an electrical break-through even at high electric 
fields and possesses little defects at the Si/SiOa interface. The good quality of this 
interface guarantees high mobility of the carriers in the inversion channel. Due to 
their perfect compatibility, the pair Si/Si02 has quickly become the main stream 
microelectronic element of Si-based MOSFETs. Low defect density, high yield, and 
a relatively simple and inexpensive fabrication process have put MOSFETs into the 
heart of all modern high density integrated circuits. 

Although the basic design of the transistor did not change, the operation speed 
and performance have increased dramatically. This became possible thanks to the 
scalability of the MOSFETs. Gordon Moore, one of the founders of Intel, has pos- 
tulated the rule known as the Moore's law, according to which the MOSFET size 
reduces exponentially. A new generation of transistors with improved performance 
is introduced every two to three years which allows to double the number of transis- 
tors on integrated circuits every two years, decrease costs per transistor and increase 
performance for the same costs. With the 32 nm technology node presented at the 
International Electron Devices Meeting in December 2008 by Intel, the Moore's law 
did not loose its actuality and MOSFET scaling is successfully continuing. 

Nevertheless, although the scaling is keeping pace with Moore's law, new tech- 
nological solutions for MOSFET design had to be introduced beginning from 
the 90 nm technology node. These crucial changes are addressing growing heat 
generation caused by rapidly increasing leakage currents in scaled devices. 

For a high-speed operation it is indispensable to have high drive current in the 
open, or on state of the transistor. In scaled devices the reduced gate length however 
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2 1 Introduction 

results in a gradual channel control worsening which leads to high source-to-drain 
current in the passive, or off-state, for similar gate voltages. One option to keep the 
ratio between the on- and off-currents sufficiently high for operation is by decreas- 
ing the off-current, which can be done by increasing the gate voltage swing between 
the on- and off-state of the transistor. However, this again leads to high power pro- 
duction and thus is unacceptable. In order to continue scaling under the constrain 
of reduced heat generation the transport properties of the channel in the on-state 
must be improved. Since scattering with defects and surface roughness are already 
optimized, future progress requires a profound modification of the electron band 
structure leading to the increase of the carrier velocity. 

Application of strain allows to increase the on-current significantly without 
changing the transistor design and meeting the projected performance increase. 
Although it has been long known that the electrical properties of silicon strongly 
depend on applied stress, strain as a mobility booster was first introduced in the 
MOSFET fabrication process at the 90 nm technology node. Since then strain 
engineering has become an integral part of the MOSFET fabrication process. 

FinFET and ultra-thin body MOSFET multigate non-conventional structures pos- 
sess superior channel control and reduced leakage as compared to bulk planar 
MOSFETs and are therefore suitable candidates for providing successful scaling to 
the end of the ITRS roadmap. Stress can be easily incorporated in non-conventional 
MOSFETs and is thus completely compatible with the upcoming non-classical 
MOSFET structures. Therefore, strain engineering is expected to keep its pace and 
remain one of the key elements of Complimentary MOS (CMOS) technology at the 
22 nm technology node and beyond. 

Strain is not the only new element introduced recently into CMOS production 
process. In order to guarantee a proper control over the channel in the 65 nm node 
transistor the silicon dioxide layer has become so thin that the gate leakage current 
and related heat generation could no longer be ignored. This prevents future sili- 
con dioxide size reduction, and a new paradigm of scaling under the constraint of 
heat generation must appear. The solution is to replace the native silicon dioxide by 
another oxide with higher dielectric permittivity. This replacement allows to further 
reduce the equivalent electrostatic dielectric thickness thus improving electrostatic 
channel control while keeping the physical oxide dimension thick enough to prevent 
tunneling. At the same time, to reduce the depletion layer in the gate and to partly 
recover the channel mobility the polysilicon gate is replaced by a metal gate. 

Although this step looks natural and simple, the introduction of a new dielec- 
tric and metal gates represents the most revolutionary change in the history of 
semiconductor industry and MOSFET production process since the replacement of 
germanium by silicon. Intel first introduced the new hafnium-based dielectrics with 
metal gates for its 45 nm technology node, and high-k materials with improved prop- 
erties are now used in the 32 nm transistor. Together with new dielectric and metal 
gates, an improved technique to induce more strain into the channel for obtaining 
enhanced performance are employed for the 45 nm and the next 32 nm technology 
node. 
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Manufacturing complexity and production yield increase development cycle time 
and costs. Statistical parameter fluctuations are becoming more pronounced with 
shrinking transistor dimensions causing broader variations in device and circuit 
performance. It is customary to have a tool which allows predicting transistor 
properties thus making design easier. Technology modeling and simulations help 
reducing R&D costs and shorten the design cycle. Therefore, Technology Computer 
Aided Design (TCAD) tools are indispensable for development and optimization of 
upcoming generations of devices and integrated circuits. 

In order to be predictive, TCAD tools must be based on accurate physical models. 
Although piezo-resistive coefficients describe modifications of electrical properties 
of bulk silicon on stress for small strain values, it is not enough to model transport 
in inversion channels, where the corresponding coefficients depend on carrier con- 
centration, doping, channel length, etc. More detailed transport models are therefore 
required to describe current enhancement in inversion layers as well as in FinFETs 
and ultra-thin body FETs. The transport model must include carrier quantization 
in the confined direction. It has to include all appropriate carrier scattering mecha- 
nisms. For strain engineering the models must include stress induced modification of 
the band structure. These modifications have a profound impact on subband quanti- 
zation energies, effective masses, non-parabolicity parameters, wave functions, and 
thus on scattering matrix elements. Although strain engineering is a mature technol- 
ogy to increase CMOS performance, the maximum performance enhancement has 
not been yet analyzed. A careful analysis to determine optimal conditions that lead 
to enhanced transport properties and the current boost is therefore needed. 



Chapter 2 

Scaling, Power Consumption, and Mobility 

Enhancement Techniques 



2.1 Power Scaling 

The power dissipation of a CMOS circuit consists of the dynamic (due to switching) 
and the static contribution in the off-state and can be written as [68] 

P = Y.^'i^i^Dof + JoffVdd, (2.1) 



where < a, < 1 is the "switching activity factor" of the /th circuit block, C, is the 
total effective capacitance including that of all the interconnects and input capac- 
itance of transistors, / is the clock frequency, and I off is the total current in the 
off-state of all the transistors biased by the power supply voltage Vod- In contrast to 
Iqff, the on-current I on = ^, (^OAf)/ participates in (2.1) indirectly, via the speed 
requirement 

/ = p/r, (2.2) 

where 

r = CiVDD/(IoN)i, (2.3) 

and /> <JC 1 is the fraction of the fraction of the clock period 1// taken by the 
capacitance recharging constant r. 

The model of the power consumption described by (2.1)-(2.3) is approximate, 
however it captures the basic balance between the static and dynamic components 
of power generation. 

At the beginning of the CMOS era the power consumption was reduced by scal- 
ing the transistor dimensions and thus the supply voltage Vdd down. However, with 
approaching 100 nm channel size, the Vdd scaling has slowed down. One of the rea- 
sons was a gradual increase of the currents in the qj^^-state. This increase was mostly 
due to parasitic leakages, the most important is due to carrier tunneling through a 
thinner oxide. Indeed, in order to maintain a proper electrostatic control over the 
channel the thickness of the gate dielectric separating the gate from the channel 
must be reduced together with scaling of the gate length, which leads to a sharp 
increase of tunneling through a thin dielectric. With an increase of the Oj^-current 
one option to preserve the high ratio Iqn/ Iqff is to increase the supply voltage 
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6 2 Scaling, Power Consumption, and Mobility Enhancement Techniques 

Vdd- This option is, however, unacceptable, since, according to (2.1), it leads to an 
increase of the power consumption. 

The industry has faced the problem of increase of heat generation already at the 
90 nm technology node. The engineering solution to continue scaling, increase per- 
formance, and keep the heat generation under control was the introduction of strain 
into the channel [20]. Strain modifies the transport properties of the transistor in 
the open state, while keeping them practically unchanged in the off-state. If the Iqn 
current is increased by applying stress, it leads, according to (2.2), to higher speed 
and performance. Therefore, if a higher Iqn is achieved for the same I off and Vdd, 
the performance gain is accomplished at nearly no increase of the power generation. 
Alternatively, the performance similar to an unstrained device is achieved at lower 
Vdd and thus reduced power consumption. 

Although a new technology of high-k dielectric/metal gate, which allows reduc- 
ing loFF (and thus power consumption) while preserving the good control over the 
channel, was introduced at 45 nm technology node [43], stress technique remains 
one of the main boosters of performance enhancement with scaling. In the 32 nm 
technology node introduced by Intel at the end of 2008, the fourth generation of 
advanced channel stressors is employed [44] allowing to get tensions of 1 .2-1 .5 GPa 
in the channel. In 2009 nearly 2 GPa stress in the channel was achieved [50]. 

The on-current boost by stress is due to the strain-induced mobility enhance- 
ment in the channel. Depending on the stress conditions, up to the fourfold mobility 
enhancement for holes and nearly twofold for electrons was reported [71] and up to 
50% increase in transistor drive current [22, 43, 62, 76, 80] was documented. The 
mobility enhancement is predicted up to the stress level of at least 3 GPa [71], which 
is higher than the level currently delivered into the channel. It makes stress a viable, 
competitive, and important technology which will certainly be used to boost the 
performance of future technology generations beyond the 32 nm technology node 
currently in production. 

In this chapter we will briefly review the history of stress in silicon and the main 
techniques to introduce stress currently utilized in laboratories and industry. Stress 
is not the only option to enhance mobility in the channel. As shown in Fig. 2.1, sub- 
strates different from the commonly used (001) wafers may also be used to obtain 
higher mobility. This hybrid orientation technology becomes important with the 
introduction of Fin-FET devices with the [110] channel direction, where the two fin 
interfaces are (110) oriented. Devices with channel directions different from [110] 
can also be considered. Finally, alternative channel materials with mobilities higher 
than silicon mobility, e.g., germanium or III-V semiconductors can be used. 



2.2 Strain Engineering 

Strain engineering technologies are based on enhancing the transport properties 
by mechanically stressing the silicon channel of a MOSFET. The advantage of 
these techniques is that they allow to get higher performance without changing the 
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Fig. 2.1 Classification of stress techniques. Mechanical stress is used in laboratories 



MOSFET size and architecture dramatically. Several techniques to deliver strain 
which require only little change in the process flow have been developed. This 
allows to integrate strained silicon into the manufacturing process at low additional 
production costs. 

The influence of strain on transport in semiconductors has been a research topic 
for over half a century. Already in the beginning of the 1950s it was discovered 
that stress may influence the intrinsic silicon mobility [23,63]. To explain the effect, 
Herring and Vogt [26] have generalized the deformation potential theory initially 
proposed by Bardeen and Shockley [7] to describe the coupling between electrons 
and acoustic waves in solids and to express the relaxation times via the effec- 
tive mass and deformation potentials. They have shown that the electron mobility 
change is due to repopulation between the valleys and reduced inter-valley scatter- 
ing. Both effects are caused by stress-induced energy shifts which, depending on 
the stress condition, lead to the lifting of degeneracy of the six equivalent valleys. 
This interpretation of the mobility enhancement is often used to explain the mobility 
enhancement due to uniaxial stress as well, although, as we will show below, it is 
valid only for uniaxial stress in [001] direction, or, equivalently, for a biaxially, or 
inplane stressed sample. The effective mass change appears in [110] stressed sam- 
ples, as was first demonstrated by Hensel [25] 1965 but since then well forgotten. 
Only recently [73] the Hensel-Hasegawa-Nakayma model of the conduction band 
was used to model the mobility enhancement in uniaxially stressed MOSFETs with 
technologically relevant [110] channel direction [72]. 

The stress-induced valence band shifts and warping are essential to understand 
the hole mobility modification. The k-p-based model [40] with a Hamiltonian 
including strain [8] has been a reliable and inexpensive method to address the 
stress-induced valence band modification since 1963 [24], which is successfully 
used nowadays to describe the subband structure in inversion layers [66]. 

The transport properties of strained silicon can be reasonably well predicted by 
piezoresistance coefficients for small stress values. However, the value of piezore- 
sistances depends on the parameters like doping level or temperature and should be 
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measured for each sample. Another problem is that the bulk values of the piezoresis- 
tances may not be used to predict the behavior of MOSFETs with confined carriers 
in the surface layer where the piezoresistance depends on the effective field as well. 

Until the beginning of 1990 stressed silicon was studied by the physics commu- 
nity, but remained relatively unexplored for engineering applications [18]. In the 
pioneering work by Welser in 1992 it was demonstrated that an n-MOSFET with a 
channel built out of biaxially stressed silicon possesses nearly a 70% higher mobil- 
ity [77]. In 1993 an increase of hole mobility in a p-MOSFET was reported [45,46]. 
The biaxial stress in silicon was achieved by growing the silicon layer on SiGe 
substrate. The drive current enhancement in pMOSFETs as a function of germa- 
nium concentration was investigated in [56], while short-channel n-MOSFETs were 
studied in [55]. History and the current status of the technology based on biaxially 
strained silicon, SiGe, and germanium channel MOSFETs is discussed in detail in a 
recent review [38]. 

By now the industry has adopted several technologies to introduce strain in the 
Si channel of MOSFETs. The key challenge is to make the technology compatible 
with the CMOS manufacturing process flow. For uniaxial stress the integration was 
successfully achieved [9, 20, 35, 64]. This is why uniaxial stress first introduced in 
[19, 33, 61] is currently employed by the silicon industry. Uniaxial stress results in a 
smaller threshold voltage shift [69] and higher mobility enhancement [66]. Modern 
stress techniques are compatible with the multi-gate architectures [30-32, 67] and 
were recently integrated with high-k dielectrics and metal gates [15,79]. 

Although many strain technologies were developed and introduced up to now, 
they can be conveniently divided into two distinct categories: global techniques 
where stress is introduced into the whole wafer, and local techniques, where stress 
is delivered to each transistor separately and independently (Fig. 2.1). Local stress 
is usually introduced during the process of MOSFET fabrication and is sometimes 
called process-induced stress. 

Stress must be beneficial for the transport boost in both n- and p-type channels. It 
turns out that to get the performance improvement n-MOSFETs should be stretched, 
while p-MOSFETs must be compressed. Obviously, the global stress technique 
cannot provide the current improvement for both n- and p-MOSFETs. Therefore, 
industry uses local stress techniques, although biaxially stressed Si can also be used 
to increase mobility of n-type transistors [16]. We begin with biaxially stressed Si 
on SiGe technology. 



2.3 Global Strain Techniques and Substrate Engineering 

High quality silicon wafers are the primary elements used in chip manufactur- 
ing. Due to growing needs for channels with improved transport properties and 
rapidly increasing expertise in synthesizing new materials with enhanced electri- 
cal, mechanical, or chemical characteristics several ways to engineer silicon wafers 
were recently explored. This results in a substrate with unique properties which 
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cannot be achieved by using silicon alone. At the beginning of 1990 the system of a 
silicon layer grown on a thick SiGe virtual substrate attracted attention to enhanced 
mobility in strained silicon [77,78]. The lattice constant of relaxed SiGe is slightly 
larger than the one in relaxed silicon. Thus, a thin silicon film grown epitaxially 
on top of a SiGe substrate becomes tensely strained due to the lattice mismatch 
between silicon and SiGe. Because of the lattice symmetry of silicon a (001) silicon 
film is equally elongated along [100] and [010] axes which results in biaxial strain. 
This type of strain is introduced globally through the whole wafer. Biaxial strain 
results in the conduction band modification which finally leads to improved elec- 
tron transport. The drive current is increased by up to 25% in sub-lOOnm strained 
silicon MOSFETs [54]. Global stress techniques are not restricted to standard bulk 
CMOS technology. Thanks to layer transfer and wafer bonding global stress is suc- 
cessfully integrated into SOI wafers. Recently, the performance enhancement in a 
60 nm gate length n-MOSFET with an ultra-thin strained silicon layer grown on a 
SiGe substrate on insulator was demonstrated [21,58]. 

Current enhancement alone is not sufficient for a technology to go into mass pro- 
duction. The new technology must be economically competitive [57] and deliver 
benefits exceeding production costs. Regardless of the proven electron current 
enhancement in biaxially strained silicon, the presence of the SiGe layer in a sub- 
strate introduces several challenges for process integration. One problem is that the 
SiGe layer induces a high density of defects in strained silicon [18]. The diffusion of 
Ge atoms into the strained silicon film reduces the thermal budget window. Due to 
the lower thermal conductivity of SiGe device self-heating may become a problem, 
especially in the SiGe on insulator structures. Finally, the diffusion rate of dopant 
atoms (boron, arsenic) is significantly different from that of silicon [74]. 

Several alternative approaches to introduce biaxial strain in silicon without SiGe 
layer were proposed. In the "strained silicon directly on insulator" technology the 
SiGe layer is eliminated before transistor fabrication. This technology delivers a 
25% drive current enhancement while avoiding the difficulties of SiGe process 
integration. 

Another back-end technique introduces strain into an already processed wafer. In 
this approach the wafer is mechanically stressed, after it was thinned down and put 
onto a polymer film. After that the wafer can safely be bonded to a final substrate. 
The advantage of the method is that it allows to introduce uniaxial as well as biaxial 
strain according to the mechanical deformation, and a 100% performance enhance- 
ment has been demonstrated [ 1 , 53], however, yield and reliability issues have so far 
prevented the technique from being used in IC manufacturing. 

As it was already pointed out, global stress techniques are able to provide only 
one type of strain through the whole wafer. However, n- and p-type channels are 
affected by strain alternatively: an in-plane biaxial tensile strain is beneficial for 
n-MOS but detrimental for a p-MOS, and vice versa. We briefly review local strain 
techniques delivering a particular stress to each MOSFET. 
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2.4 Local Stress Techniques 

Already in the 1990s it was found that certain process steps and IC elements 
appearing during wafer processing result in channel stressing and thus performance 
increase. Shallow trench isolation [41, 59], silicidation at the source and drain 
region [65], and formation of nitride contact-etch-stop layer [33, 61] were among 
earlier local stress techniques investigated (Fig. 2.2). Although the process induced 
stresses were moderate and could not provide sufficient drive current boost at the 
earlier stage, local techniques have certain advantages over global ones. Process- 
induced stress can be independently delivered to p- and n-MOSFETs guaranteeing 
the performance enhancement in both types of transistors. Additionally, stress can 
be introduced along three coordinate axes. This allows to optimize performance 
enhancement and costs, reduce the threshold voltage shifts [39], and improve inte- 
gration into the process flow [36]. Importantly, the interest in stress technology 
was supported and motivated by industry needs to optimize the ratio of the per- 
formance to heat generation for the upcoming 90 nm technology node. Several 
process-induced local stress techniques, such as stressed nitride contact etch stop 
liner, stress memorization technique, selective epitaxial growth for embedded SiGe 
in the source and drain contacts, and stress from shallow trench isolation were 
introduced in mass production of integrated circuits. 

In modern sub-lOOnm technologies the transistor dimensions are so small that 
the mechanical stress induced by shallow trench isolation becomes important [9,75]. 
Stress can be induced both parallel and orthogonal to the channel lateral directions. 

Another way to introduce compressive uniaxial stress into a p-channel is by fill- 
ing the source and drain regions with SiGe [6, 17,27,49,70,85]. For this purpose, the 
source and drain regions are etched out and a recess area is created. This recess is 
later filled by SiGe grown epitaxially in the source and drain regions [6,49]. Alter- 
natively, SiGe can also be grown on top of source and drain [12]. Depending on the 
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Fig. 2.2 Process-induced stressors employed by the semiconductor industry. Shallow-trench isola- 
tion, highly compressive and tensile capping layers, and compressive stress due to SiGe embedded 
in the source and drain regions are used in the CMOS process 
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thickness of the epitaxial Sii^xGejc and the Ge content x large uniaxial stress can 
be created using this method. 

A part of the mechanical stress from a permanently stressed layer grown on top 
of a transistor can be transferred into the channel. The value of stress transferred 
depends on the thickness and the material properties of the liner [33]. To boost per- 
formance of an n-MOSFET a tensile cup layer is needed, while for a p-MOSFET 
the compressive layer is required. Thus, two different types of stress liners should 
be used to get performance enhancement in n-channel and p-channel MOSFETs 
simultaneously. Industry adopted a Dual Stress Liner (DSL) process, where a highly 
compressive nitride is deposited on top of the p-channel MOSFET, while a highly 
tensile nitride is deposited on top of the n-channel MOSFET. Silicon nitride (Si3N4) 
capping layers can produce both tensile and compressive strain depending on depo- 
sition conditions. In the fabrication process, a tensile silicon nitride layer is created 
by thermal chemical vapor deposition over the whole wafer. Parts of the layer are 
removed above p-MOSFETs by selective etching. After that a compressive Si3N4 
layer is created by plasma-assisted chemical vapor deposition, followed by selective 
etching of the compressive layer above n-MOSFETs. Dual stress liners technol- 
ogy alone can improve the drive current by 11% in n-MOSFETs and by 20% in 
p-MOSFETs [60,81]. 

Si3N4 layers with more than 2.0 GPa tensile and 2.5 GPa compressive stress 
which introduce approximately 1.0 GPa stress in the MOSFET channel are routinely 
used in 65 nm process [4]. This technique is successfully combined with selective 
epitaxial growth for embedded SiGe in the source and drain contacts [43]. Thus, 
strain engineering techniques may not only be combined for the same transistor, but 
can be superimposed to yield even larger performance boost [27]. 

Residual channel stress may by preserved after removal of the nitride layer. This 
fact is exploited in the stress memorization technique [ 1 1 , 27, 36, 48] . In a process 
using this technique, the conventional dopant activation spike anneal is performed 
after the deposition of a tensile stressor capping layer. This layer is subsequently 
removed before an eventual salicidation process. Even though the stressor nitride 
layer is removed from the final structure, the stress has been transferred from the 
nitride film to the channel during annealing and memorized by the re-crystallization 
of source, drain and the poly gate amorphized layers. Stress from the capping layer 
can be memorized in the channel. Stress is preserved in the channel even after the 
stressed layer is removed from the final structure providing a 15% improvement of 
the drive current in n-channel MOSFETs [10]. 

Process-induced local stress techniques depend strongly on device geometry and 
must be adjusted and optimized to maximize beneficial effects from stressors [17]. 
However, regardless of the challenges of local stressors integration into the manu- 
facturing flow, local stress techniques have proven useful for industrial applications 
and promising for future technology nodes. 
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2.5 Advanced Stress Techniques 

Stress was introduced into the fabrication process flow at the 90 nm technology 
node. Since then stress is a compulsory technique to get the MOSFET perfor- 
mance enhanced included in all technology nodes. Stress techniques were constantly 
improved and perfected through the 65 nm and 45 nm technology nodes in order 
to transfer more strain into the channel. The germanium concentration in the 
source/drain regions of p-MOSFETs was constantly increased from 17% at the 
90 nm technology node to 23% at the 65 nm which resulted in a 60% increase of 
the channel strain. At the same time an enhanced process flow adopted for the 
Si3Ni4 capping layers increase the channel strain in n-MOSFETs by 80% [6]. Strain 
techniques are compatible with high-k dielectrics/metal gate technology and were 
successfully integrated in the process flow at the 45 nm technology node, resulting 
in the third generation strained silicon [43]. 

At the International Electron Devices Meeting in 2008 Intel has reported its 
second generation of high-k dielectrics/metal gate 32 nm transistors. The fourth 
generation of stress technology allowed to get approximately 14% in performance 
improvement [44] as compared to the 45 nm transistors. The technique allowed to 
build the largest SRAM with more than 1.9 billions transistors. Multiple stressors 
are combined to produce even higher strain in the channel. The fourth generation 
stress technology includes improved stress liners for both n- and p-MOSFETs. Com- 
pared to the 45 nm technology node where the dual stress liners with 1 .5 GPa tensile 
and 2.8 GPa of compressive stress were used [43], capping layers with more than 
2 GPa tensile and 3.5 GPa compressive stress are introduced for the 32 nm node. In 
combination with SiGe source/drain regions with high (approximately 30%) ger- 
manium concentration uniaxial stress of approximately 1 .5 GPa is produced in the 
channel. The replacement metal gate, or gate last, process when the poly-silicon 
gate of a transistor is removed and later substituted by a metal gate allows to pro- 
duce even more uniaxial compressive stress [5,44], as demonstrated in Fig. 2.3. 
This allows to obtain the best drive currents of 1.55mA/|xm for n-MOSFETs and 
1.21mA/|i,m for p-MOSFETs reported for 32 nm technology node at the end of 
2008 [44]. 





Fig. 2.3 Illustration of additional tensile strain introduced in the gate-last process [5, 44] 
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Tensile stress can also be generated in the n-channel MOSFET by using Sii-xCjc 
stressors with a small mole fraction x [2]. It was demonstrated that for n-channel 
MOSFETs the implementation of the SiC source/drain regions provides signif- 
icant drive current enhancement of up to 50% at a gate length of 50 nm [14]. 
Thus, Sio.99Co.01 induces as much of tensile stress as Sio.75Geo.25 - compressive 
stress [3]. This method was not incorporated yet into mass production and possesses 
a large potential to induce tensile stress for technology nodes beyond 32 nm. 

For technology nodes beyond 32 nm the gate becomes less than 30 nm, and 
improved channel control by the gate is required. Although alternative channel 
materials with improved transport properties may be the key to extend the planar 
MOSFETs down to 22 nm and even to 16 nm technology nodes, multi-gate FinFETs 
and ultra-thin body SOI based technologies provide a better channel control and 
are thus considered as viable candidates for the next generation technology nodes. 
An integration of stress into an SOI CMOS fabrication process was demonstrated 
by [27]. Like in bulk devices, an embedded SiGe process and a compressively 
stressed liner film are used to introduce compressive strain in the p-MOSFET, 
whereas a stress memorization and a tensile stressed liner are inducing tensile strain 
in the n-channel MOSFET. An optimization of the process-induced stresses yields 
an improvement in saturation drive current of 53% for p-channel and 32% for 
n-channel MOSFETs, respectively. 

Although the process of introducing global stress by growing silicon on relaxed 
SiGe substrate did not receive appreciation by industry, it has prompted the intro- 
duction of a technique called by IBM a reverse-embedded SiGe approach [16]. This 
technique employs a hurried SiGe layer to induce tensile stress in n-channel SOI 
MOSFETs. In the reverse-embedded SiGe approach the n-FET is fabricated at SiGe 
source/drain areas of p-MOSFETs following silicon re-growth. SiGe, which is typ- 
ically used to generate compressive stress in p-channels, is very efficient to impart 
tensile stress of approximately 400 MPa in the n-channel, leading to 15% increase 
of the drive current (Fig. 2.4). 



Fig. 2.4 Illustration of the 
reverse-embedded SiGe 
approach [16], when an 
n-FET with strained silicon 
(SSi) is fabricated at the SiGe 
source/drain areas of 
p-MOSFETs following 
silicon re-growth 
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With the device dimensions further reduced, the volume of SiGe source/drain 
regions shrinks as well, thus degrading its stressing capabilities. Although strain 
remains an efficient performance booster for the 22 nm and even for 16nm tech- 
nology nodes, it remains to be seen if stress is an efficient performance booster 
for short-channel devices beyond 16 nm. Thus, other techniques including hybrid 
orientation technology and alternative channel materials become important. 



2.6 Hybrid Orientation Technology and Alternative 
Channel Materials 

Carrier mobility in silicon surface layers depends strongly on the substrate crys- 
tal orientation and the direction of the current flow. For instance, hole mobil- 
ity increases if one selects the [100] instead the [110] transport direction for a 
standard (001) wafer [37,41, 84], a fact which was already employed in the 90 nm 
technology [34]. 

Electron mobility in the silicon channel is known to be the highest for the tradi- 
tional substrate (001)/[110] channel direction configuration, while hole mobility is 
maximal in (110) substrate for [110] channel direction, which is 2.5 times higher 
that the mobility for the traditional (001) substrate and [110] channel direction. 
Thus, by adjusting the crystal orientation, channel direction (Fig. 2.5), and strain 
one can optimize the transport properties for n- and p-MOSFETs [81, 82]. 

In case of a [110] oriented FinFET the two faces are (001) oriented, while the 
other two are of (110) crystal orientation. Therefore, the relative contribution of 
(001) and (1 10) surfaces into transport depends on the aspect ratio of the fin width 
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Fig. 2.5 Hybrid orientation technology uses different substrate crystal orientations and/or channel 
directions. The traditional transistor configuration on the (001) substrate with the [110] channel 
direction is shown 
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to the height. This additional option to choose the transport interface and thus to 
control the FinFET by performance makes them attractive for the technology nodes 
beyond 22 nm. 

Benefits of applying stress to (110) p-MOSFETs are under intensive investiga- 
tion. Recent computational studies on strain-induced low field mobility enhance- 
ment demonstrated that, although (110)/[110] hole mobility is superior over the 
(001)/[1 10] mobility in relaxed silicon, they become approximately equal in a highly 
stressed p-channel [66] . These results are supported by a comparative study between 
Si (110) and (100) substrates on mobility and velocity enhancements for short- 
channel highly- strained p-MOSFETs [42], where it is demonstrated that mobility 
and velocity enhancements under high channel stress for (100) substrate are larger 
than those for (110). Thus, saturation current on (100) is similar or slightly higher 
than that on (110) for p-MOSFETs with higher channel stress, and benefits of the 
hybrid orientation technique combined with stress are becoming less obvious. 

In order to extend the planar technology to the 16 nm node and beyond one has to 
use alternative materials for the channel. The use of III-V semiconductors with high 
intrinsic mobility integrated on silicon substrate is one possible solution explored 
recently. Intel has presented InGaAs quantum well device structure on silicon [29]. 
No mobility degradation in quantum well grown on silicon as compared to the well 
grown on III-V substrates was observed. Compared to n-MOSFETs, the proposed 
quantum well FET on silicon operates at low supply voltage of 0.5 V, exhibits more 
than ten times DC power reduction for the same speed performance, and expe- 
riences twofold performance gain for the same power. Recently a high mobility 
Ill-V-on-insulator FET on silicon substrate with metal source/drain contacts using 
direct wafer bonding was demonstrated [83]. Lateral tensile strain introduced in 
Ino,53Gao,47As n-MOSFETs with in-situ doped lattice-mismatched source and drain 
stressors and interface engineering is shown to boost the performance of III-V 
n-FETs [83]. Compatibility of stress techniques with the IIl-V FETs makes them 
very attractive for future ultra-scaled high-speed applications. 

For high-speed p-MOSFETs germanium channels are considered. The ultimate 
advantage of germanium as compared to silicon are two times higher electron bulk 
mobility and, most importantly, the fourfold increased hole mobility. However, 
despite the intrinsic speed advantages, germanium does not grow a stable oxide. 
Thus a manufacturing method similar to the traditional one employed for silicon 
MOSFETs is not applicable. Although high-k dielectrics have been implemented 
with germanium channel transistors [13], satisfactory quality of the gate stacks has 
not yet been achieved. This results in a mobility enhancement smaller than antic- 
ipated. Finally, germanium application to n-channel devices is prohibited by the 
low drive current of Ge-based n-MOSFET. The poor electron transport property in 
Ge-based n-channels is primarily due to the intrinsically low density of state and 
high conductivity effective masses [47]. 

At the International Electron Devices Meeting 2008 Intel has presented the 
first high-speed low power III-V p-channel quantum well FET [51, 52]. The com- 
pressively strained InSb quantum well device possesses hole mobility as high as 
1230cm^/Vs. The highest cut-off frequency of 140 GHz for III-V p-channel FETs 
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was deinonstrated at a supply voltage of 0.5 eV. As compared to a p-channel 
silicon MOSFET, the quantuin well FET produces ten times less heat at the same 
speed and develops two-times higher speed for the same power. Therefore, both 
n- and p-type IIl-V quantum well FETs are perfect candidates for future high speed 
and low power logic applications. However, before they become applicable to the 
silicon industry, many technical challenges must be overcome for Ill-V compound 
semiconductors to be integrated onto large silicon wafers [28]. 
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Chapter 3 
Strain and Stress 



3.1 Strain Definition 

All parts of a relaxed solid are in mechanical equilibrium with each other. Let 
us characterize a point within the solid by a radius vector r with the coordinates 
(x,y,z) in a Cartesian coordinate system. Under application of external forces the 
solid gets deformed: it changes its form and the volume. Due to deformation the 
point r moves to another point r' = (x' , y',z!)- The difference r — r' is called 
displacement u. The displacement 

u(r) = r'(r)-r (3.1) 

defined as the function of a coordinate r characterizes the deformation of a solid 
quantitatively. 

The dependence of the displacement u(r) on the position r is the reason of mod- 
ification of the relative distances between the two points in a solid. The distance 
between the two points AL' = ^ Ax''^ + Ay'''- + AtI^ in a deformed solid can be 
expressed via the distance AL = ^J{Ax)^ + {Ayy- + {Azf- in the relaxed solid 
and the displacement u as 



/ ^ / 9m,- du i duk dujc \ 
AL' = AL^ +i^ + ^ + ^^\Ax,Axj, (3.2) 

where summation over repeating indices is assumed {i, j = x,y, z). The expression 
(3.2) can be rewritten as: 

' Ar~, (3.3) 



A 


L' = 


y/AL'^ + Isik 


the strain tensor 
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" 2 


( dui du j 

1^ + 1^ + 
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dxj dxj 



(3.4) 
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24 3 Strain and Stress 

is introduced. By the definition (3.4) the strain tensor is symmetric: 

Sij = Sji. (3.5) 

Therefore, the strain tensor (3.4) can be diagonalized by an appropriate transforma- 
tion of the coordinate system. The coordinate system in which the strain tensor is 
diagonal depends on the position r. In the system where the strain tensor is diagonal 
the modification of the distance between the two points AL' is expressed as: 



AL' = ^(1 + 2eff)ZiL| + (1 + 2s^^)ALl + (1 + 2ef^)Z\L2. (3.6) 

Thus, an arbitrary deformation of a small volume around the point r can be repre- 
sented as a combination of three simple independent deformations along the three 
orthogonal axes diagonalyzing the strain tensor at this point. A relative change of 
the length along an axis a = ^, v, ^ is 

ALoc - AL' I 

^- 5L = ^i+2e„„-l. (3.7) 

ALqi 

In case of small displacements, which is a typical situation of up to a few percents 
deformation, the relative change in the length La is proportional to 8aa '■ 

ALa - AL' 

^T7 = Saa. (3.8) 

ALa 

The strain tensor in this case simplifies to 

1 / 9m,- du i \ 

''^ = 2 (a^ + a^ j ■ ''■'' 

According to (3.8), the volume change (AV — AV')/AV is proportional to the 
sum of Saa- Since the trace of a tensor is invariant under a coordinate transformation, 
the relative volume change can be written as: 

AV- AV 

—r^ = EXX + Syy + S^,. (3.10) 

We mention other quantities used in literature to describe strain. Frequently, the 
engineering strain tensor e,y is used. It is defined as: 



^xx ^xy ^xz 

^xy ^yy ^yz I — I -^^xy ^yy -^^yz I- W-^IJ 
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Here, sometimes the off-diagonal terms of the engineering strains are denoted as 
Yij = eij = Isij. 

Due to the symmetry e,/ = Sy,- the strain tensor is characterized by six indepen- 
dent components, which may be arranged into a vector: 

(sxx,£yy,£zz^2£y,,2sxz,2sxy) = (ei , 62, 63. €4, 65, ee) ■ (3.12) 

The six components notation is convenient, when writing the relation between strain 
and stress in cubic semiconductors. 



3.2 Stress 

In a deformed solid the internal structure is changed. Due to deformation the atoms 
are shifted from their equilibrium positions. This leads to the appearance of the 
internal forces trying to bring the atoms back to their equilibrium. In the absence 
of piezo-electric effects the forces of interatomic interaction are short-ranged. In a 
situation when we are interested in deformations of a volume which includes many 
atoms, the interaction radius can be set to zero. Thus, the internal forces due to 
deformation are local. This means that these forces can act on a certain part of a 
deformed solid only through the surface surrounding this part of a solid. 
The total force F acting on the part of a solid can be written as an integral: 



-/ 



fdV, (3.13) 



where f is a force acting on an elementary volume of the selected part. Since the 
interactions between the elementary volumes inside the part of a solid can not pro- 
duce the force F acting on the part, the total force F is due to the action of the 
external surrounding environment on the selected part. Due to the short-range nature 
of interaction forces, this action is local and is applied to the surface surrounding 
the selected part. Thus, the component i = x,y,z of the total force F must be equal 
to an integral over the surface of a vector c, : 

Fj = I ajdS = V" / OijdSj, i,j = x,y,z, (3.14) 

j 

where JS is a vector orthogonal to the surface element and pointed in the outer 
direction with an absolute value equal to the area of the surface element. The second- 
rank tensor a,y is called the stress tensor. 

The definition (3.14) of the stress tensor allows a simple interpretation. Let us 
choose a small cube as the selected volume, with its faces orthogonal to the axes of 
the coordinate system. Let us consider a single face of the cube, say dSx oriented 
along the positive direction of the axis x (the outer normal vector is in the positive 
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Fig. 3.1 Illustration of the 
force d¥ acting on a cube 
face dS used in defining the 
stress tensor (3.15) 




direction of the x axis). The force dF acting on the face dSx is not necessarily par- 
allel to the X axis as shown in Fig. 3.1. This force is characterized by the component 
parallel to the face plane, the shear components, and the component orthogonal to 
the plane, the normal component. Then the components of the stress tensor dix at 
the point surrounded by the selected cube are defined as the limit 



,. dFi 

Cix = lim -— 
ds^o dS 



(3.15) 



Other components are defined by analogy. The off-diagonal components are due 
to the projection of the force dF on the face and lead to a shear distortion of the 
selected cube. The off-diagonal stress components are also called shear. 
From the GauB theorem, (3.13) and (3.14) it follows that 



and 



= //.v = / 



fi = divff, 



divajdV, 



(3.16) 



The definition of the shear tensor ct,/ is not unique [4]. This uncertainty is removed 
by making the stress tensor symmetric: 

a, J = Oji. (3.17) 



This symmetry is required by the condition that the total moment with respect to an 
arbitrary point in equilibrium is zero. 
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3.3 Relation Between Strain and Stress Tensor 
in Silicon and Germanium 

External forces applied to a solid result in deformation and internal stress. Depend- 
ing on the amplitude of the force applied to a relaxed solid it can either return into 
an initial relaxed state or, alternatively, it can preserve a certain level of deforma- 
tion even when the external force that caused the deformation is removed. In the 
last case of large deformation the atoms are moved so far away from their equi- 
librium position that the solid finds another local energy minimum. The potential 
barrier separating the local minimum from the global one preserves the solid to 
return into relaxed state. In case of relatively small deformation the structure of the 
solid is not changed dramatically, atoms are shifted only little from their equilib- 
rium positions. In this case of elastic deformation there is a unique relation between 
deformation and internal strain. The well-known elementary text-book example is a 
deformed spring where the relation between the applied force F and deformation u 
is described by Hooke's law F = ku, where k is the spring constant. 

A generalization of Hooke's law describing the relation between the stress and 
strain tensors in a three-dimensional elastically deformed solid was suggested by 
Cauchy 

^ij = CijuBkl- (3.18) 

Here Ciju is the elastic stiffness tensor of the fourth order. Number of independent 
entries depends on the symmetry of the solid [3]. Silicon and germanium belong to 
the cubic semiconductors. In this case the stiffness tensor is characterized by only 
three independent components conveniently written as ci i , c\2, and C44. The values 
of the stiffness constants of silicon and germanium [5] are summarized in Table 3.1. 
The stiffness tensor allows to find the stress tensor if strain is known: 



(cixx\ 

Oyy 



Ylxyj 



(cn C12 C12 \ 

C12 cii C12 

C12 C12 cii 

C44 

C44 

V C44/ 



( Sxx\ 

Syy 

^^yz 
2£xz 

V^xy) 



(3.19) 



Alternatively, the relation between strain and stress is given by the compliance 
tensor Sjju 

Sij = SijkiOki, (3.20) 



Table 3.1 Elastic stiffness 
constants of Si and Ge [5] 



Silicon Germanium Units 



Cll 

C12 
C44 



166.0 
64.0 
79.6 



126.0 
44.0 
67.7 



GPa 
GPa 
GPa 
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or in matrix form 



Syy 

ISy, 

2exz 



All *12 512 \ 
^12 *11 ^12 
^12 ^12 ^11 

544 

544 

\ 544/ 



(cixx\ 

Oyy 

Ovz 



Y'xyJ 



(3.21) 



Taking into account that the compliance tensor is an inverse for the stiffness tensor, 
the compliance constants sij are expressed via the stiffness constants c,y as 

Cll + C12 



■^11 - 2 , 

Cfi +C11C12 

-C12 


-2c?2 


■^'^ 2 , 

Cll +C11C12 

1 


-2c?2 



544 



and 



C44 



3.4 Strain and Stress Tensors: Examples 
3.4.1 Uniform All-Around Compression 

The components of the stress tensor can be easily determined in the case of uniform 
compression. Forces acting on any face of a selected small cubical volume are equal. 
The force (iF acting on a face dSx is equal to JF = (—pdSx,0, 0). According to 
(3.15), the stress tensor possesses only diagonal elements and can be written as: 




(3.22) 



Strain and stress in a solid are completely determined by the elements of the cor- 
responding tensors in a given coordinate system. It is convenient to choose the 
coordinate system in cubic semiconductors so that the axes are orthogonal to the 
cube faces. This system is called crystallographic coordinate system. Stress and 
strain tensors will be always given this system. 

In order to characterize directions and plane orientations in cubic crystals the 
notation of Miller indices is used [ 1 , 3] . A triplet of integers [nmk] defines the coor- 
dinates of the radius vector along the direction in the crystallographic coordinate 
system, while {nmk) is used to describe all the directions equivalent by symmetry 
to [nmk]. Similarly, to determine orientation of a plane, the triplet of integers (nmk) 
is used to define the coordinates of the vector normal to the plane, while {nmk} 
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(100) 



(110) 



(111) 



Fig. 3.2 Three most frequently used surface orientations characterized by the Miller indices (100), 
(110), and (ill) 



describes all the planes equivalent by symmetry to the (nmk) one. Negative indices 
are indicated with a bar above the number. Orientations of the most commonly used 
surfaces (100), (110), and (111) are shown in Fig. 3.2. 



3.4.2 Biaxial Strain Resulting From Epitaxial Growth 

In a global stress technique a thin silicon layer is grown epitaxially on a top of 
relaxed virtual SiGe substrate. Due to the lattice constant mismatch between silicon 
and SiGe the silicon layer is biaxially stretched. The strain tensor in the crystallo- 
graphic coordinate system for an arbitrary substrate orientation can be evaluated [2] 
by first determining the strain tensor a,^^ in the interface coordinate system and then 
transforming it by an appropriate coordinate transformation. The in-plane strain is 
described by the difference between the lattice constants Uy and as in relaxed [5] 
and strained silicon, respectively: 



(3.23) 



Thus, the in-plane components of a^^ are equal: 



(3.24) 



During the epitaxial growth shear distortions usually do not appear, which allows to 
set all the off-diagonal components to zero: 



s'^j = 0, 



i i^ }■ 



(3.25) 



The only undetermined component of the strain tensor £33 is found from the condi- 
tion that the stress component in the direction of growth is set to zero: a'^-^ = 0. By 
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using the generalized Hooke's law a,y = CL^e'j^^ in the interface coordinate system, 

n: 

"^3311 + "^3322 „ 



the unknown £33 is written in the form: 



=33 



(3.26) 



-3333 



As soon as the strain tensor in the surface coordinate system is determined, it can 
be transformed to the crystallographic coordinate system by a unitary transforma- 
tion U 

s=Us'U^, (3.27) 



where U is the rotation matrix. (3.27) can be written in its unfolded form: 

Ski = Uiks'ijUij, 



(3.28) 



where the summation over repeating indices is assumed. 

The direction of the z' axis of the surface system is described by a polar angle (p 
and azimuthal angle 9 in the crystallographic coordinate system. Thus, the surface 
coordinate system can be obtained from the crystallographic system by rotating it 
first around the z axis by a polar angle (j) followed by a rotation around the new y' 
axis by an azimuthal angle 9 shown in Fig. 3.3. The transformation matrix U can 
be written as: 



U = {R,{cP)Ry(e)) 



(3.29) 




Fig. 3.3 Transformation of the crystallographic coordinate system to the interface system is 
described by a counterclockwise rotation around the z axis followed by a rotation around y' axis 



3.4 Strain and Stress Tensors: Examples 
where 



cos{(p) sin(0) Oi 
Rzi4>)= |-sin(./.)cos(</.)0 
1, 



Ry'iO) 



fcosiO) - sin(^)^ 

1 
^sin(0) cos(9) / 



Therefore, 



U 



fcos{d) cos{(p) — sin(0) sin(0) cos(^)^ 

cos{6) sin(0) cos(0) sm(9) sin(0) 

- sin(0) cos(6l) 
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(3.30) 



(3.31) 



(3.32) 



Using the transformation U, the stiffness tensor in the interface coordinate 
system needed in (3.26) can be expressed via the one in the crystallographic 
system as: 



Cijkl = Ui,„UknC' UjpUls, 



(3.33) 



Below are the strain tensor examples for silicon grown epitaxially on (001), 
(1 10), and (1 1 1) SiGe virtual substrates: 



?(001) 



/lO 
1 

00- 






2Ci2 



?(110) 



en/ 

2C44 



Cl2 



Cll + Ci2 + 2C44 
Cll +2Ci2 



e(iii) 



Cll +2Ci2 
Cll + Cl2 + 2C44 

2C44 - Ci2 

Cll+Ci2 + 2C44 Cii+Ci2 + 2C44 



4C44 Cii+2C12 



v 



Cll + 2ci 



Cll +2ci2 + 4C44 

Cll +2Ci2 _ 

Cll + 2ci2 + 4C44 Ci 
Cll +2Ci2 

Cll + 2C12 + 4C44 C 



Cll + 2C12 + 4C44 
4C44 
+ 2ci2 + 4C44 

Cll +2C12 
1 + 2ci2 + 4C44 



Cl 


1 + 2C12 


+ 4C44 




Cll +2 


Cl2 


Cl 


1 + 2C12 
4C44 


+ 4C44 



+ 2ci2 + 4C44 

(3.34) 



/ 



The strain tensor contains shear components if silicon layer is grown on a (1 10) and 
(111) substrate. These components can be large for higher germanium concentra- 
tion, as shown in Fig. 3.4. 
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0.1 0.2 0.3 0.4 

Ge concentration, x 




0.1 0.2 0.3 0.4 

Ge concentration, x 



Fig. 3.4 Strain in a silicon film grown epitaxially on (110) (left panel) and (HI) (right panel) 
Sii— .yGbv as function of germanium concentration .v. All shear strain components are non-zero in 
the (111) case 



3.4.3 Uniaxial Stress 

As we have mentioned in the previous chapter, uniaxial process-induced stress is 
currently employed by the semiconductor industry. Several important examples of 
the strain and stress tensor for different stress directions are briefly reviewed below. 
It is convenient to choose the coordinate system in which one of the axes, say z, 
is parallel to the stress direction. In this coordinate system the stress tensor has only 
one non-zero component a J = P, where P is the stress magnitude. By transform- 
ing the stress tensor back to the crystallographic coordinate system, we obtain the 
following expressions in case of stress in [001], [110], [1 1 1], and [120] directions: 



/OO 0^ 
ff[ooi]= 00 
\OOPj 



'P/2 P/2 Q\ 
o-[iio]= I P/IP/IQ 

oy 



'P/l> P/2, P/3\ 
ff[iii] = I P/2 P/2 P/2 
yP/2 P/2 P/2) 



'■[120] 



P/5 2P/5 0\ 
2P/5 4P/5 

oy 



(3.35) 



The strain tensor is found from (3.35) and (3.20): 



?[001 



^12 0\ 




P 


il2 


e'[iio] = 


T 


sn) 







'^^11+^12 '«44/2 ^ 
544/2 511-1-512 

V 2si2j 
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0.1 0.2 0.3 0.4 

Ge concentration, x 



[100] Stress [GPa] 



Fig. 3.5 Left panel. Strain in a silicon film grown on a (001) Sii— .,Gev substrate as a function of 
germanium concentration x. Right panel. Strain as a function of [001] uniaxial stress. All shear 
strain components are zero 





















— ^xx - Sy 


0.01 


- 


^zz 








^. 




>v^ 


- ■ ■ ^xy 


Ge 


■ 


— 


^^^^ 


0.005 


"* . 


'"'^^^ 


^ 


i- 


Si 





^ 


K, 


k-* 




/ - 


-0.005 


^i<p^ 




^^^ 


x^/,^ 


-0.01 













-1 1 

[110] stress [GPa] 



O.OOS 
0.006 ■ 
0.004 ■ 
0.002 
■ 
-0.002 
-0.004 ; 
-0.006 : 
-0.008 









- 


— =xx = =yy = =zz 

- - =xy = Exz = =yz 


• 


r 
. • 


• 

I.I. 


I.I.I. 



-1 1 

[111] stress [GPa] 



Fig. 3.6 Strain tensor components as a function of [110] and [111] uniaxial stress in silicon and 
germanium. Contrary to [100] uniaxial stress, shear strain components aie non-zero 



h\\ + lS\2 S44/2 SaaJI ^ 

«[111] = Y I ■^44/2 S\\^lSx2 SAAJl 



?[120] 



P 

y 



Saa ■^12 + 4.^11 

5sx2} 



(3.36) 



Figure 3.5 demonstrates strain in silicon epitaxially grown on (001) Sii-jcGe^ 
substrate as a function of germanium concentration x and strain in silicon and ger- 
manium under [001] uniaxial stress. In both cases shear strain is absent. If, however, 
silicon or germanium are stressed along [110] or [111] axes, non-zero shear strain 
components appear, as displayed in Fig. 3.6. As will be shown below, this type of 
deformation plays an important role in mobility enhancement. 
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Chapter 4 

Basic Properties of the Silicon Lattice 



4.1 Crystal Structure of Silicon and Germanium 

Atoms in crystalline silicon are arranged into a repeating three-dimensional pat- 
tern - a crystal. The crystallographic unit cell of the crystal structure of silicon is 
shown in Fig. 4.1. The crystal lattice can be represented as two face centered cubic 

lattices (fee), with the cube side ao = 0.543A, where the second lattice made of 
atoms B is displaced relative to the first one made of atoms ^ by a translation vec- 
tor of ^(1, 1, 1) along a diagonal. If the atoms A and B were different, the lattice 
structure is called zinc-blend lattice structure. The crystal lattice of GaAs, AlAs, 
InAs, InP, InSb, and many other III-V semiconductors is of the zinc-blend type. 

When the atoms A and B are of the same type, like in diamond, the lattice type 
is called diamond. This is the case of the typical representatives of the group IV 
semiconductors such as silicon, germanium, and SiGe alloys in any proportions. 

The translational symmetry of a three-dimensional crystal can be reproduced by 
taking a minimal number of atoms called a basis set which is repeatedly translated 
by three primitive lattice vectors a, , z = 1 , 2, 3. In cases of silicon lattice structure 
the basis set consists of the two Si atoms A and B . The basis vectors of the Bravais 
lattice are shown in Fig. 4.2: 



ai = y 1 , »2=y , and 33 = y | 1 | , (4.1) 



where ao is the lattice constant of the relaxed lattice. The lattice is obtained by 
translation of the basis atoms set by multiples of the basis vectors and their linear 
combination: 

a = /ai -Hia2 -h^as, (4.2) 

where i, j and k are integers. In the relaxed lattice the distance from the basis atom 
at ^(1,1,1) to its four nearest neighbors is equal to a ■ -Jli/A, and the atomic 
packaging factor is \/2i7i/\6, with eight atoms per unit cell. 
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ao 



Fig. 4.1 Crystallographic unit cell (unit cube) of the diamond structure 



Besides with the translations (4.2), the diamond crystal structure is invariant 
under point symmetry operations. These symmetry operations can be represented 
as a superposition of reflections, rotations, and inversion. Inversion over a point 
can also be represented as a rotation by ir around an axis going through the inver- 
sion point followed by a reflection on a plane perpendicular to the rotation axis and 
crossing it at the inversion point. However, due to the importance of the inversion 
symmetry, it is usually considered as a separate symmetry operation. 

Let us briefly review the symmetry classification of crystal lattices. 

By definition a symmetry transformation applied to a lattice brings the lattice 
into coincidence with itself thus leaving it invariant. Any symmetry transformation 
may be decomposed into an elementary rotation about an axis, reflection in a plane, 
and translation. A rotation is defined by specifying the direction of the rotation axis 
and the angle of the rotation. A reflection in a plane is determined by specifying the 
plane. A translation by a vector displaces each point of a lattice by this translation 
vector. A combination of two symmetry transformations applied to a lattice in suc- 
cession is again a symmetry transformation since it leaves the lattice invariant. An 
inverse to a symmetry transformation is again a symmetry transformation. A set of 
these symmetry transformations, or elements, form a group. Every group contains 
one and only one identity element. 

A symmetry group possessing a fixed point common to all transformations of the 
group is called a point group. For a Bravais lattice, the totality of reflections and 
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Fig. 4.2 The primitive basis vectors of the face centered cubic (fee) lattice and the two basis atoms 
^andB 



rotations which map the Bravais lattice onto itself have a fixed point and therefore 
form a point group. This point group of the Bravais lattice is also the symmetry 
group of the crystal symmetry group which characterizes the translational symmetry 
of the crystal [1]. There are seven point symmetry groups for all crystals, or seven 
systems of Bravais lattices: triclinic, monoclinic, orthorhombic, tetragonal, trigonal, 
hexagonal, and cubic. Each system may contain several types of Bravais lattices: 
simple, volume-centered, base-centered, and face-centered. For example, the cubic 
system includes simple, body-centered, and face-centered lattice types. The relaxed 
silicon diamond structure belongs to the face-centered cubic system. In total the 
seven systems subdivide into fourteen lattice types. 

The system of the Bravais lattice and the lattice type, characterize the symmetry 
group of a crystal incompletely. In compound crystals, like III-V semiconductors, 
the primitive cell contains more than one unlike atom. The Bravais lattices built on 
the different atoms coincide, however, the equivalent points are not the same. Thus, 
inversion, which is an element of the symmetry group of the Bravais lattice, may 
not be a symmetry element of the symmetry group of a crystal. The elements of the 
symmetry group of the Bravais lattice, including translations, that transforms every 
direction in the crystal to an equivalent one form a subgroup of the initial group. 
The equivalent direction in the crystal is the direction along which all the crystal 
properties are identical. This subgroup is called the crystal class. In total there exist 
32 different crystal classes [1], 
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Even the crystal class and lattice type, however, are not sufficient to characterize 
the space symmetry of a crystal completely. Within a point group all elements have 
a common point, while for a space group characterizing the symmetry of a crystal, 
the position of each rotation element within the primitive cell must be specified. 
Therefore, apart from elements of point group symmetry, the space group contains 
translations by a vector of the length less than a vector of the Bravais lattice. For 
example, a compound lattice containing at least two equivalent atoms per primitive 
cell, which is the case of the group IV semiconductors like silicon and germanium, 
may be viewed as a system of inter-penetrating identical Bravais lattices shifted by 
this translation vector. These additional translation operations map the positions of 
each component lattice onto the identical points of another lattice. Therefore, in this 
case a space group characterizing the symmetry of a crystal contains an invariant 
translation subgroup. By dividing all the elements of the space group into co-sets 
modulo the translation subgroup, one can construct the corresponding factor group, 
which is isomorphic to the crystallographic point group characterizing the crystal 
class. Altogether, there exist 230 different space groups. 

The following symmetry operations are defined: 

E Unity operation 

n^ Cloclcwise rotation of angle 2jr/« around axis e^ 

n^ Counter-clockwise rotation of angle In /n around axis e^ 

I Inversion 

h^ Clockwise rotation of angle In /n around axis et followed by inversion 

n^ Counter-clockwise rotation of angle In/ n around axis ej^ followed by inversion 

The rotation axes Ck are grouped into five classes and defined by the following 
Miller indices: 

e, (1.0,0). (0,1,0), (0,0,1) 

e," (0,1,0), (v/3,-1,0), (-v/3,-1,0) 

e; (1,1,1), (-1,-1,1), (1,-1,-1), (-1,-1,-1) 

Cp (1,1,0), (-1,1,0), (1,0,1), (0,1,1), (-1,0,1), (0,-1,1) 

e, (1,1,0), (-1,1,0) 

Thus, the symbol nf^ stands for rotations around three axes e, and corresponds 
to three point symmetry operations. 

The set of all point operations for a particular crystal lattice structure forms a 
group. Relaxed silicon crystal lattice belongs to the Oh symmetry group [5], in 
Schonflies notation. It is characterized by the symmetry elements listed in Table 4.1. 
A more detailed description of the Oh symmetry group is presented in Chapter 7. 

The point group of the diamond structure which corresponds to relaxed silicon 
consists of 48 symmetry elements. 

Table 4.1 Symmetry elements of the point symmetry group of relaxed silicon lattices 

Point group Symmetry elements Number of elements Stress direction 

Oh £2,3+372^4+4,-12,3+372,4+4,- 48 relaxed 
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4.2 Reciprocal Lattice and First Brillouin Zone 

For a given Bravais lattice defined by the vectors a a reciprocal lattice can be con- 
structed using the vectors b. The basis vectors of the reciprocal lattice b, are defined 

by the relation: 

a, X at 
b/ = 2;r ' ^ , (4.3) 

(ai X aj) ■ 33 

where ay x a^t denotes the vector product, (aj x aj) ■ as is the volume of the 
primitive cell, and a cyclic rotation (/, J, /c) —^ {k,i,j) —^ {j,k,i) of the indices is 
assumed. Thus, the basis reciprocal lattice vectors are: 



and b3 = — 1 . (4.4) 






It follows from (4.4) that 

b,- • a/ = InSi/ , 

where Sij is the Kronecker symbol. 

An arbitrary reciprocal lattice vector is written in the form 

G= Ibi+mb2+nb3, (4.5) 

where l,m, and n are integers. 

The symmetry point group of the reciprocal lattice coincides with the symmetry 
group of the Bravais lattice, however, the type of the reciprocal Bravais lattice in 
general is not the same as that of the Bravais lattice of the crystal structure. 

The first Brillouin zone is the symmetrized Wigner-Seitz cell in the reciprocal 
lattice. To construct the Brillouin zone, one should connect a chosen point on the 
reciprocal lattice to all nearest points on the reciprocal lattice and define planes per- 
pendicular to the vectors connecting the points. These planes lie equidistant between 
the chosen and the connected point. The polyhedron bounded by these planes rep- 
resents the Brillouin zone. It follows from the construction of the Brillouin zone 
that there are no pairs of equivalent vectors inside the zone. There exist 24 different 
types of Brillouin zones. 

The first Brillouin zone represents the primitive cell of the reciprocal lattice. The 
interface boundaries of the first Brillouin in silicon zone are determined by planes 
that are perpendicular to the reciprocal lattice vectors pointing from the center of 
the cell to the 14 lattice points nearest to the origin of the cell at their midpoints. 
These 14 faces are 

O 1 O 1 O 

\k.\ + \ky\ + \k,\ = - — , \k,\ = —, \ky\ = —, and \k,\ = —.(4.6) 

L ClQ Uo do ^0 

The first Brillouin zone of silicon is shown in Fig. 4.3. 
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Fig. 4.3 First Brillouin zone of silicon crystal lattice. The six equivalent valleys of the conduction 
band of silicon along the three principal axes are also shown 



Due to the translational invariance of the lattice the energy bands are periodic in 
the reciprocal space. It is sufficient to know the energy dispersions only in the first 
Brillouin zone for band structure calculations [5]. The notion of the first Brillouin 
zone is important to define all linearly independent solutions of the Schrodinger 
equation with the periodic crystal potential briefly reviewed in the next section. 

Let us select a point k in the Brillouin zone. For each vector k there exists a 
set of symmetry operations from the space group that leaves k unchanged or maps 
it onto an equivalent vector. This set of elements is usually called the little group. 
Like the space group, the little group contains an invariant translation subgroup. By 
factorizing the translation subgroup one can construct the point symmetry group 
corresponding to the given point k. For k = this point symmetry group coincides 
with the crystallographic point group. In general, the point group for a given k is a 
subgroup of the crystallographic point group characterizing the crystal class. Points 
and lines of the Brillouin zone can therefore be classified according to their point 
symmetry groups. 

The number of symmetry elements in the point subgroup of the little group 
depends on the wave vector k. Because of the unity operation the symmetry group 
contains at least one element. If the symmetry group of a given vector k contains 
more elements than the symmetry group of neighboring points, this specific vector 
k is referred to as symmetry point. Thus, points and lines of the Brillouin zone can 
be classified according to their symmetry. 

In Fig. 4.4 the symmetry points (filled dots) and symmetry lines of the relaxed 
silicon lattice are shown. Strictly speaking, the points K and U are not symmetry 
points because they have the same symmetry as the points along the symmetry lines 
S and 5, however, it is convenient to keep special symbols for these points due to 
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Fig. 4.4 The locations of certain symmetry points and symmetry lines in the first Brillouin zone 
of silicon crystal lattice 

their symmetric positions in the first Brillouin zone. By convention, capital Greek 
letters are used for points and lines inside the Brillouin zone while capital Roman 
letters denote points and lines on the surface of the first Brillouin zone. 

Since the center point F of the Brillouin zone is mapped onto itself at any point 
operation of the crystal lattice, all symmetry operations of the lattice are included 
in the point group P{r). Because of the symmetry, it is not necessary to compute 
the dispersion relations in the whole Brillouin zone. The group P(r) determines 
the volume of a part of the Brillouin zone sufficient to obtain the energy spectrum 
of electrons in a crystal. This part is called the irreducible wedge. The number of 
symmetry elements in the symmetry group determines the volume of the irreducible 
wedge as [5] 

^irred = i^Bz/|i'(n|, (4.7) 

where j/'C/^)! is the number of elements of the symmetry group P(r). 



4.3 Particle in a Periodic Potential 



In order to describe the motion of a carrier in a periodic crystal potential U(r) one 
has to solve the corresponding Schrodinger equation for the wave function \f'(r): 

H^{r) = + U(r) ^{r) = E^{r), (4.8) 

V 2mo / 
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where otq is the electron mass, fi is the Plank constant, and E is the energy. The 
periodicity of the potential C/(r) : 

I/(r + a) = C/(r), (4.9) 

where a is the periodicity vector (4.3), which makes the Schrodinger equation invari- 
ant under any translation r ^- r + a. It means that the wave function W{r + a) 
describes the same state of a particle in a periodic potential as the wave function 
\f'(r). Therefore, the wave function 'P{r + a) must be proportional to '/'(r). The 
proportionality constant can be complex, however, its absolute value must be equal 
to one, in order to prevent an infinite growth of the wave function at large values of 
the translation vector a. It is convenient to choose the proportionality constant to be 
equal to exp(/ka), where k is an arbitrary constant vector. Then the wave function 
^{r} can be written as [3] 

'^«k(r) = e"""M„k(r), (4.10) 

where M„k(r) is the periodic function: 

M„k(r + a) = M„k(r). (4.11) 

A possibility to write the solution of the Schrodinger equation with a periodic poten- 
tial in the form of a product of a plane wave envelope and a periodic function (4.10) 
is the result of the Bloch theorem. The solution (4.10) is called the Bloch function, 
or the Bloch state, while the periodic part M«k(r) is frequently named as the periodic 
amplitude of the Bloch function, or the periodic Bloch function. 

At a fixed value of k the Schrodinger equation (4.1) has an infinite set of solutions 
numbered by an integer index n. The energy Enik) corresponding to the solution 
M„k(r) also depends on the vector k and index n. For an n fixed En(k) takes the 
values within a certain interval called the energy band. Different bands may be sep- 
arated by the energy gaps or may overlap. In the last case there is more than one 
state characterized by the same energy. If the bands intersection appears at the same 
value of the k vector, the bands become degenerate at these points, and their energy 
dispersions may develop peculiarities close to the intersection points. 

Under the translation r ^- r -|- a the Bloch function transforms as 

^ij + a) = e'^^^{r). (4.12) 

The phase factor can be multiplied by a factor exp/Ga = 1, where G is any 
reciprocal lattice vector, without altering the result. Thus, the k vector is defined up 
to a reciprocal lattice vector. However, since the wave function (4. 12) is not changed 
and describes the same state, the Bloch function must be periodic in the reciprocal 
lattice: 

•^nk+cW = "/^nkCr). (4.13) 
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It then follows that the band energy is also a periodic function in the reciprocal 
lattice: 

£„(k+G) = £„(k). (4.14) 

Therefore, all non-equivalent values of k are lying within the first Brillouin zone. 

Apart from the periodicity (4.14), the band energies £n(k) satisfy additional 
symmetry relations due to the point group symmetry of the crystal [4, 5]. Thanks 
to these additional symmetries the band structure calculations in relaxed silicon 
may only be performed on a 1/48-th part of the first Brillouin zone. The energy 
dispersion in the whole Brillouin zone is recovered using the following eight 
reflections 

En{k^,ky,k,) = En{\k^l\ky\,\k,\) (4.15) 

and six permutations 

Enyi^x-, i^y-, l^z.) ^ Ef,{Kx , liz: i^y) ^ Eii{ky, Kx, ICz) ^ 

The irreducible wedge of the fee lattice is shown in Fig. 4.4. It has six corners 






(4.17) 



connected via symmetry lines 



A:r - 


^L, 


A-.r - 


^x. 


S :X - 


^U,K, 


s :r - 


^U,K, 


Q:L- 


^w. 


Z :X - 


^ W. 



(4.18) 

Here, the points U and K can be interchanged, since they are equivalent in the 
reciprocal space. 

Due to the time-reversal symmetry [2] 

£„t(k) = E„^(-k), (4.19) 

where "I" and | stand for the spin projections. In addition, in crystals with a center 
of inversion the band energies satisfy the relation [2] 

£„t(k) = £«;(k). (4.20) 

The Bloch functions corresponding to different n and k within the first Brilloin 
zone are orthogonal [3]: 
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j ^X,{r)'P„^{r)dr = J e'*-''')'-<,k,(r)M„k(r)^r = 3„„,5(k - k'), (4.21) 

where * stands for complex conjugate, 5(k — k') is the delta-function. It follows 
from (4.10) that the periodic Bloch amplitudes M„k(r) with the same k are also 
orthogonal: 

/ M*,k(r)M„k(r)fi?v = -j^^-^Snn,, (4.22) 

where, because of the periodicity of Mnk(r), the integration is restricted to a unit cell 
The equation for the periodic Bloch function amplitude is in the form: 



(V + ikf + V{r) 



2m 



unkir) = £„(k)M„k(r). (4.23) 
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Chapter 5 

Band Structure of Relaxed Silicon 



5.1 Conduction and Valence Bands 

Intrinsic silicon is a semiconductor. The band gap Eg = 1 . 12 eV at 300 K separates 
the maximum of the last filled band called valence band from the minimum of the 
first empty band called conduction band. The minimum of the conduction band is 
shifted with respect to the maximum of the valence band situated at the F point of 
the first Brillouin zone by the wave vector .STo = 0.85 (27r)/(2o, where ao = 0.54 nm 
is the lattice constant of the relaxed lattice of silicon. Because the conduction band 
minimum is shifted with respect to the T^-point of location of the valence band 
maximum, silicon is a semiconductor with an undirect gap. 

The conduction band consists of six equivalent minima located symmetrically 
along the axes [100], [010], and [001] at a distance Kq from the F symmetry point, 
or at a distance ko = 0.15 {27t)/ao from the corresponding X-point. Close to the 
minimum of the conduction band the dispersion E(k.) is usually described within a 
parabolic approximation 

, hHk,-kof fi^{kl + k^^) 

£(k) = -^ ^ + —^ ^, (5.1) 

2m/ 2m; 

where the masses m/ = 0.916mo and nit = 0.19mo are called longitudinal and 
transverse effective masses of silicon [ 1 ] . To account for a deviation of the measured 
density of states from a purely parabolic dispersion behavior at higher energies an 
isotropic non-parabolicity correction is introduced via the expression [1]: 

£(k)(l + aisk) = -^ ^ + -^ ^, (5.2) 

2m/ 2m( 

where the value of the non-parabolicity parameter a = 0.5 eV~^ is determined phe- 
nomenologically. We will demonstrate, however, that the dispersion relations (5.1) 
and even (5.2) do not describe the modification of the conduction band of silicon 
due to uniaxial [1 10] stress, and a more complex description of the conduction band 
is needed. 
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The three highest valence bands in silicon are the so-called heavy hole band, 
the light hole band, and the band split due to the spin-orbit interaction. The heavy 
and light hole bands are degenerate at the maximum situated at the F symmetry 
point of the first Brillouin zone. The maximum of the spin-orbit split band is also 
at the r point and lies Z\ ,„ = 44 meV below the maximum of the light and heavy 
hole bands. Due to the degeneracy between the light and heavy hole bands their 
dispersions may not be described within a parabolic band approximation due to 
their non-analytical behavior near the maximum. A six-band k-p theory [19] usually 
employed to describe the bulk dispersions in the valence band includes the spin-orbit 
interaction between the three valence bands and is thus a good approximation. 

The band structure of a crystal material can be qualitatively evaluated by using 
symmetry of a crystal [31]. For example, due to the point symmetry of the face- 
centered lattice the six directions [100], [100], [010], [OlO], [001], and [001] are 
equivalent. Consequently, the six conduction band valleys shown in Fig. 4.3 are 
equivalent. However, in order to find the correct dispersions, the solution of the 
Schrodinger equation (4.1) with the periodic crystal potential is required. This is 
the subject of electronic band structure calculations. The periodic potential consists 
of the core potential contribution from the ions forming the crystal and the elec- 
tronic contribution. Both parts represent a challenge in describing the band structure. 
The core Coulomb potential is singular at the position of an ion. This singularity 
makes the wave function strongly oscillating close to the core thus requesting for 
a special stable numerical methods capable to capture such oscillations. The elec- 
tronic part is the result of action of all the electrons on a particular carrier and must 
be found self-consistently. It turns out that the so-called correlation corrections to 
the self-consistent Hartree field are very important in determining the band struc- 
ture correctly. This is why accurate band structure calculations represent a serious 
computational challenge even nowadays. 

Electronic band structure calculations can be subdivided into the three dis- 
tinct groups: first-principle, empirical numerical, and perturbative analytical meth- 
ods. Methods based on the first-principle calculations, or ab-initio, are of great 
importance because they do not require any phenomenologically adjustable fitting 
parameters. These methods are based on the density-functional theory. 



5.2 First-Principle Band Structure Calculations 

The Hohenberg-Kohn theorems establish the ground of the density-functional meth- 
ods. The first theorem states that the ground state density of a system of interacting 
particles in an external potential is uniquely determined by the external potential. 
Alternatively, the profile of the external potential is uniquely determined by the den- 
sity of the system of interacting particles in the ground state. It means that there is 
one to one correspondence between the ground state density and the external poten- 
tial. Therefore, the Hamiltonian describing the many-body system is completely 
determined by the ground state density. Thus, all the properties of the system in the 
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ground and excited states can in principle be determined provided that the density 
in the ground state is known. 

The second theorem states that the energy of the ground state of a system of 
interacting particles can be calculated as a minimum of a universal functional for 
the energy E[n] which depends on the density n(r) [13]. However, the theorem does 
not provide a recipy on how to build such a functional for a system of interacting 
particles. 

In 1965, Kohn and Sham [16] proposed an approach which has enabled practical 
electronic structure calculations. The idea of the Kohn-Sham approach is to replace a 
complicated system of interacting particles with a simpler system that can be solved 
exactly. The assumption that Kohn and Sham employed is that an auxiliary system 
consists of an ensemble of non-interacting particles. It is assumed that the ground- 
state density of the ensemble of non-interacting particles is equal to the density in the 
original interacting system. By solving the equations for non-interacting particles 
one finds the density and the ground-state energy of the original system: 

E[n] = T[n] + VhM + f V^^ndr + £,,[«], (5.3) 

where Vexi is the external potential, 

N 

n{r) = Y,^*{r)fi{r) (5.4) 

is the density on an A^ -particle system, i/f,(r) are the Kohn-Sham orbitals, T[n\ is 
the kinetic energy: 

T[n] = J2f *V^*W (-^) ^' W' (5-5) 

Vh is the Hartree contribution: 

e^ f , f , ,«(r)n(r') 
VH[n] = -jdrjdr'j-L}-l^ (5.6) 

e is the electron charge, and Ejcdn] is the unknown exchange correlation energy 
functional. 

The Kohn-Sham single particle equations are obtained by varying the energy 
functional (5.3) with respect to a set of orbitals i/f, : 

^ V2 + Kff(r) j ifi (r) = Ei ifi (r), (5.7) 



2m 
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where 

Keff(r) = V,Ar) + Vh (r) + ^^^ . (5.8) 

d«(r) 

After the equations for the single-particle orbitals are resolved, the wave function 
of the many-body system is build as a Slater determinant. 

In the Kohn and Sham approach, all the difficult many-body terms beyond the 
external potential and the self-consistent Hartree field are put into the exchange- 
correlation potential. Therefore, the accuracy to find the exact ground state density 
and the energy are only limited by the approximations in the exchange-correlation 
functional. The exchange correlation functional is unknown and can be constructed 
approximately. The local density approximation or generalized-gradient approx- 
imations are quite accurate in describing the ground-state properties. However, 
the challenge is to develop density-functional-based methods to describe accu- 
rately excited state properties. The applicability of local density approximation 
to excited properties is limited due to several reasons. First, the approximate 
exchange-correlation potentials are jellium-based, thus, they incorporate an artifi- 
cial self-interaction. The discontinuity of the exchange-correlation potential with 
respect to the number of carriers is also neglected. The incomplete cancellation of 
the self-interaction is the reason why the gap as well as the properties of localized 
states are incorrectly predicted. 

The exact-exchange based density-functional theory in the optimized effective 
potential approach is free from the self-interaction error and greatly improves the 
Kohn-Sham energies with respect to the quasi-particle excitations [25]. In spirit of 
the Kohn and Sham theory, the total energy of the system is divided into known and 
unknown parts. In addition to the Kohn-Sham approach, the Fock exact-exchange 
energy contribution is isolated in the exchange-correlation functional: 

Ex[n] = --}_^J drj dr ^^-^^ . (5.9) 

For the occupied / = j states the exact-exchange term (5.9) cancels exactly the 
self-interaction contribution from the Hartree energy functional (5.6). Varying the 
energy functional E[n] = Ee^i[n] + Vnln] + Ex[n] with respect an orbital one 
obtains the set of the Hartree-Fock equations for ^, (r): 

where Up (r, r') is a non-local operator: 

e2 ^ /r)Vf*(r') 

SF{r,r') = --J2 ; ' , ■ (5.11) 

2 '^—' r — r' 

An effective local exact-exchange potential can be derived by making the vari- 
ation of (5.9) with respect to the density n(r) and applying an optimized effective 
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potential approach [6, 20]. The local correlations can be easily added into the exact- 
exchange potential-based density-functional theory by including the local density 
correlation energy [22] which improves results for the band gap. 

To obtain accurate gaps for a large set of solid materials a more advanced 
many particle method called the GW approximation is currently employed. This 
method allows to evaluate the self-energy E of an interacting system in terms of the 
single-particle Green's function G and the dynamically screened Coulomb interac- 
tion W [10]. Using the self-energy S the Green's function can then be evaluated, 
which allows to address the band structure and the spectral properties of the sys- 
tem. Because in the GW approximation the spectrum of elementary excitations, or 
quasi-particles, is evaluated the method gives much better results for the gaps of 
semiconductors and insulators, when it is applied after DFT calculations [25]. The 
GW approximation is implemented in several packages for ab-initio calculations. 
We have used the Vienna Ab-nitio Simulation Package (VASP) [29] to obtain the 
results for the silicon band structure. 



5.3 Pseudopotential Band Structure Calculations 

The first-principle band structure calculations, especially with the GW approxima- 
tion included became available only recently and are typically applied to study the 
band structure of new or unstudied materials with not very well known proper- 
ties. The band structure of silicon is a subject of research for a long time and is 
well known experimentally. Several approximate numerical methods to evaluate the 
band structure have also been developed. The method of empirical pseudopotentials 
allows to reproduce all the characteristics of the band structure known experimen- 
tally including the gap, the spin-orbit split-off energy, the effective masses, the 
non-parabolicity parameter. The method employs only a relatively small set of 
parameters which can be calibrated to reproduce the properties of several materi- 
als. Below we follow the method of non-local empirical pseudopotentials of Rieger 
and Vogl [24], which allows to describe the band structure of silicon, germanium, 
and SiGe. 

Pseudopotentials were first introduced by Fermi [9] to investigate high-lying 
atomic states. The idea of the pseudopotential method of band structure calcula- 
tions is quite simple. Electrons on a silicon atom can be divided into a group of 
electrons localized on the ion core and a group of valence electrons involved in 
binding to nearest silicon atoms. The valence electrons are from the outer partly 
filled shells of silicon atoms and are nearly free. While moving in the crystal these 
electrons do not feel the pure Coulomb potential of the ion core. Instead, because 
the core is dressed by the localized electrons, the valence electrons are subject to 
the screened core potential. In order to describe the properties of silicon due to 
these nearly free valence electrons it is enough to approximate in the vicinity of 
the core the real singular Coulomb potential by a soft effective potential. The soft 
effective potential changes the behavior of the wave function close to the ion core 
removing the strong oscillations in the singular Coulomb potential. This allows to 
resolve the Schrodinger equation numerically. At the same time, the properties of 
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Fig. 5.1 The idea of the pseudopotential method is to replace the Coulomb potential by a smooth 
pseudopotential within the core region [31] 



the nearly free valence and free conduction electrons are determined by the behav- 
ior of the wave function outside the ion cores and are therefore not affected by the 
substitution of the real core potential by a smooth effective potential. The effective 
potential called the pseudopotential is schematically shown in Fig. 5.1. 

In order to construct the pseudopotential. Herring [ 1 2] has suggested to approx- 
imate the crystal wave function i/'^k(r) of the valence or conducting electrons as a 
linear combination of a smooth function ^k(r) augmented by the core states Wj^ 
localized at the ions: 



^k(r) = .^k(r) - ^('/'yk l-^k) ^Mr), 



(5.12) 



where the coefficients are chosen from the condition that the crystal wave function 
i/fk(r) is orthogonal to the wave functions of the localized electrons: ('/'/klV'^k) = 0. 
The smooth wave function 0k (r) satisfies the following equation: 






2m 






0k(r) = £(k)0k(r), 
(5.13) 
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where it is taken into account that for the localized states 

. A_V2 + Vc{r)\ ^Mr) = Ej^jk. (5.14) 

According to the cancellation theorem [23], the value of the pseudopotential 

V{r) = Vc(r) + y- 7M jkivk/ jky j ^^^^^ 

j ^ 

is small in the core region, while it has the correct Coulomb behavior e^/r far 
from an ion. Therefore, (5.13) determines a pseudo-wave function <^k(r) which has 
a smooth behavior in the core region. Thus, the pseudo-wave function is a good 
approximation to the true wave function outside the core region and can be used, 
for example, to evaluate transport characteristics which depend on the properties of 
the valence and conduction electrons. Regardless the fact that the pseudopotential 
Schrodinger equation is for the pseudo-wave function and not the real crystal func- 
tion, the eigenenergies in (5.13) correspond to the true energies of the crystal wave 
function. 

The pseudopotential depends on the solutions of (5.13-5.14) and is therefore 
not known. It can, however, be approximated by a function with a few parameters. 
The parameters can be adjusted later to reproduce the known characteristics of a 
semiconductor, like the energy gap, effective masses of electron and holes. A simple 
but efficient approximation due to Heine and Abarenkov for a pseudopotential of a 
single atom is to assume it coincides with the correct Coulomb potential outside the 
core, while the behavior of the pseudopotential is modeled by a constant within the 
core region. This constant may depend on energy E. The so called empty core model 
is obtained, when this constant is set to zero, while the constant effective potential 
model is recovered, when the core potential is equal to the Coulomb potential at the 
core distance R from the ion. 

By its definition (5.15) the crystal pseudopotential depends on angular momenta 
present in the core states and the energy E [7]. Inspite of the fact that the pseudopo- 
tential is non-local, many properties can be found by assuming K to be a simple 
function of position r. The pseudo-potential can be constructed from the single-ion 
pseudopotentials Fo(r): 

V{r) = Y, (^o(r - Ri + T) + Ko(r - Rj - t)) , (5.16) 

where the summation is over the lattice vectors R,, and r = a/8(l,l,l)is the 
vector determining the positions of the two silicon basis atoms in the unit cell. The 
pseudopotential is a periodic function and can be expanded into a Fourier series over 
the reciprocal lattice vectors: 

V{r) = J2 nGj) exp(/rGy). (5.17) 
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Importantly, the coefficients V{Gj) are expressed in terms of the Fourier harmonics 
of the ion potential Vb(r) alone: 



V(Gj) = 2Vo(Gj)S(Gj), 
where i2 is the volume of the unit cell. 



(5.18) 



^o(G;) = ^j drVo{r)expi-irQj), 



and S{G) is the structure factor of a diamond lattice: 



1 



S{Gj) = - {exp(irGj) + sxp{-irGj)) 



(5.19) 



The crystal pseudopotential can also be easily constructed for III-V semicon- 
ductors with the zinc-blend lattice crystalline structure. In this case two types of 
pseudopotentials I^iii(r) and Kv(r) for the two sorts of atoms must be introduced. 
The periodic potential can be again developed into the Fourier series in the form 
(5.17), where the coefficients 1^(G) are: 



V{G) = cos(tG)F,(G) + / sm{rG)Va(G), 



(5.20) 



Vs = Fill + Vy and Va = Vm — Vy are the symmetric and antisymmetric form 
factors. In the case of diamond lattice, when Vm = Vy, (5.20) coincides with (5.18, 
5.19). 

Because pseudopotentials do not have singularities within the core region and 
decay fast outside of the core, they can be considered as weak perturbations. There- 
fore, the plane wave basis | k) = 1 / \^ exp (/ kr) is suitable to find the pseudo- wave 
function ^k(r): 

<^k(r) = ^fl,|k + G,), (5.21) 

where the summation is over reciprocal lattice vectors Gj. Substitution of (5.21) 
into (5.13) results in the following system of homogeneous equations: 



(k + G^ 
2mn 



E(k) 



ai + ^ ViGi - Gj)aj 



(5.22) 



In order the non-zero solution a, exists the determinant of the matrix made of 
the coefficients in (5.22) must be set to zero. The corresponding secular equation 
determines the eigenenergies E(V.) in the crystal: 



det 



(k + G,)^ 
2mo 



-£(k) 



&G,,Gj + V{Gi-Gj) 



= 0. 



(5.23) 
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^ q 



Fig. 5.2 Fourier-transform of the pseudopotential rapidly decreases at large q [31]. Due 
to its periodicity the pseudopotential is determined by the Fourier-harmonics at |G^| = 
(2jr/a)2x (0,3,4,8,11,12,...) 



In the local pseudopotential method the ion pseudopotential is assumed to be 
spherically symmetric: Fo(r) = ^o(|r|)- In this case the form-factors depend 
on the absolute value of the reciprocal lattice vector G,. This assumption dras- 
tically reduces the number of form-factors needed in (5.23). These from-factors 
can be considered as free parameters which can be adjusted in order to accurately 
reproduce the band structure. Because of the periodicity of the reciprocal lattice 
the pseudopotentials are only needed at discrete G points. The square of the dis- 
tance from the origin to each equivalent set of reciprocal lattice sites is equal to 
IG^I = (In/af X (0, 3, 4, 8, 11, 12, . . .) [31]. Because of the Fourier components 
Voiq) decrease fast (typically as l/q'^) with q increased, one can neglect the pseu- 
dopotential form factors for G^ > ll(27r/a)^, as shown in Fig. 5.2. This is the 
manifestation of the fact that the pseudopotential is a smooth function of r, thus 
it can be approximated with only a few Fourier harmonics. For the diamond lat- 
tice structure there are only three relevant parameters V{2jry/3/a), V{2Ti\/%/a), 
and V{27T\f\\/a). The value K(0) gives only a rigid shift in energy and can be 
set to zero, while the value V{ATt/a) = because of the vanishing structure factor 
(5.19). The corresponding form-factors for silicon and germanium are listed in the 
Table 5.1 [24,27]. 

As it was pointed out by Chelikowsky and Cohen, the local pseudopotential 
approximation is not sufficient when describing the properties of holes, namely 
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Table 5.1 Pseudopotential parameters used in calculations [24,27] (1 Rydberg = 13.6 eV) 



Parameter 


Ge 


Si 


Units 


F/ 


-0.221 


-0.2241 


Ry 


vi 


0.019 


0.0520 


Ry 


vA 


0.056 


0.0724 


Ry 


Ao 


0.0 


0.03 


Ry 


A2 


0.275 


0.0 


Ry 


Ri 


1.22 


1.06 


A 


IJ-so 


0.000965 


0.00023 


Ry 


? 


10.0911 


8.0 


A-' 



the density of states in the valence band. To correct the behavior, an energy 
dependent non-local correction term K„; is usually added to the local atomic pseudo- 
potential [7]: 



Vni(r,E) = Y,ME)fi(r)Pi, 



(5.24) 



l=\ 



where Ai is the energy-dependent constant, that can also be interpreted as the well 
depth [7], // (r) is the function approximating the core wave function with / symme- 
try, and Pi is the projection operator for the / angular momentum. Using the series 
expansion of the plane wave 



expO'qr) = '^i^ Pi(cosa)ji(qr), 



(5.25) 



/=o 



where P/ (cos a) are the Legendre polynomials, a is the angle between q and r, and 
jl iy) are the spherical Bessel functions of the first kind, the matrix elements of the 
non-local pseudopotentials can be written in the following form: 



47t 



(k+G||F„/||k' + G)= — ^(2/ + l)/'Kcos^)5(k-k') (5.26) 

1=0 

00 

xjdrr^fl(r)ji{\k+G\r)jiQk' + G\r). 



It turns out that only the components with / = 0, 1, 2 are important. For silicon, 
the core wave function is approximated by a Heaviside step function (0(x) = 1 for 
X > 0, 0(jc) = Oforx <0) 



fl{r) = 0{Ri-r), 
while for germanium the core wave function is a Gaussian [24]: 



(5.27) 
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/,(r) = exp(-^J. (5.28) 



For the correct description of the valence band of silicon and germanium, the 
spin-orbit interaction responsible for the band splitting must be taken into account. 
The spin-orbit coupling is the result of the interaction of the electron spin a with the 
magnetic field B produced by its orbital motion. The spin-orbit Hamiltonian H^o is 
in the form [31]: 

/^.™= j^[VKxp]ff, (5.29) 

where V is the potential energy, p = — /^V is the momentum operator, and a = 
(CTx , <^y , o'z) ^ is the vector made out of the Pauli matrices: 

Substitution of the Bloch function (4.10) into (5.29) results in the following operator 
acting on the periodic amplitude of the Bloch function: 



Hso=-^^[yVxip + k)]a, (5.31) 

4wqC"^ 

The term with the crystal momentum k is small as compared to the term with p. This 
is because the momentum of the electron on a localized atomic orbital described by 
the term < m„|p|m„ > is much larger than the typical crystal momentum k. This 
also explains why the spin-orbit interaction is more important for valence electrons 
which are more localized than the conduction electrons and thus move faster on 
their orbitals. The matrix elements of the spin-orbit interaction can be presented in 
the form [24]: 




VsoiKs, K's') = --^ I ^^ — 1 SiK-K')B(K)B(K')[K x K'] (^1 a \s') , 

(5.32) 



where 

B(k) = 



5il + {K/m'' 

and Hso is the spin-orbit parameter. The values of parameters used in the empir- 
ical pseudopotential band structure calculations are summarized in the Table 5.1. 
For germanium we have used the parameters from [24]. The spin-orbit interaction 
parameters piso and ^ for silicon are taken from [27]. 

In order to extend the empirical pseudopotential method on Sii-xGcjc alloys the 
virtual crystal approximation is applied. The form-factors and the parameters of the 
non-local and spin-orbit interactions are interpolated linearly in the mole fraction x. 
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The expression for the lattice constant is taken in the following form [24]: 

a(x) = ao + 0.0200326x(l — x) + (aoe — ao)x^ , 
where a Ge = 0.5658 nm. 

5.4 Semi-Empirical Tight Binding Method 

The pseudopotential method is based on the separation of all the electrons into 
two groups: strongly localized core electrons and nearly or completely delocalized 
valence or conducting electrons. This allows to use the plane wave basis to describe 
the band structure by the empirical pseudopotential method. On the contrary, the 
tight-binding approach explores the band structure from another limiting case and 
uses the localized atomic orbitals as the basis function. Thus, this approach is 
expected to give good results for the valence band structure and somewhat less accu- 
rate results for the conduction bands. The reason that the two opposite approaches do 
work reasonably well is that the electrons in silicon are delocalized in the conduction 
band and are thus well described by the delocalized basis functions while the valence 
electrons are mostly localized in the bonds and may be described using the local- 
ized basis states. The atomic wave functions are usually classified by their orbital 
momenta. The states with the orbital momentum / = 0, 1, 2 are usually designated 
ass, p,d, correspondingly. Symmetrically orthogonalized atomic orbitals (pi (r — R) 
centered at the atom position R called Lowdin orbitals [ 1 8] are conveniently used as 
the pseudo-atomic basis states. They satisfy the one-particle Schrodinger equation 
with the Hamiltonian h{r — Ri): 

h(r-Ri)cP,{r-Ri) = £;.^,(r- R,). (5.33) 

The introduction of the Lowdin orbitals allows to get rid of the troublesome overlap 
integrals between the orbital states ai (r— R) of the two atoms located at the positions 
R,- and Ry 

S-r^ = f dra„,(r - R,)a„(r - R,) (5.34) 

by the following transformation [8]: 

0/(r-R,) = Y, [S-^")^^^^ a^(r-Rj). (5.35) 

mj 

When the atoms are assembled into the crystal lattice, the total Hamiltonian is the 
sum of the single atom Hamiltonians h{r — R, ) and the term due to the inter-atomic 
interaction 8H{r): 
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Hir) = J2Hr- R/) + SH(r). (5.36) 

The inter-atomic interaction SH(r) is small and can be taken as perturbation. By 
neglecting SH{r) the Bloch function of the crystal is written via the Lowdin orbitals 
in the following form: 

(Pmkir) = \mk) = ^Y exp(/kRy),^„(r - Rj), (5.37) 

J 

where A^ is the number of primitive cells. The wave functions (5.37) satisfy the 
properties 

^mk(r + a) = e'^^0mk(r). 

Using the orthonormaUty of the pseudoatomic Lowdin orbitals, the orthogonality of 
the Bloch functions (5.36) is ensured. 

The functions (5.37) are used to expand the solution '/'t of the Schrodinger 
equation with the total Hamiltonian (5.36): 

tf'k(r) = ^C«^«k(r). (5.38) 

m 

This leads to the following equation for the energy dispersion E{k.) 

det[H -£■(]«)] = 0. (5.39) 

The matrix elements of the Hamiltonian from (4.8) computed with the basis func- 
tions (5.36) are [26]: 



H,„ik) = (/k| H Ink) = J2 exp(/(R;k) f dr<p'[{r)H(j>„{r - Ry). (5, 



40) 



The overlap integrals 

/,„(Ry) = j drcp*ir)H(P„(r - Ry) (5.41) 

are the parameters of the method. At Ry = they describe the on-site orbital ener- 
gies (I = n) and hopping parameters between different orbitals at the same atom 
I ^ n, while for Ry ^ they represent hopping parameters for electrons between 
different sites. In order to reduce the number of parameters, several approximations 
are typically applied. 

First, only a few orbital functions are included. The name of the model is typi- 
cally related to the number of orbital functions preserved. The most commonly used 
nowadays sp^d^s* model employs one s, three p, five d, and one "excited" s* 
orbital. The dimensionality of the matrix H in (5.36) is determined by the number 
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of orbitals included. In case of several different basis atoms in the primitive cell the 
dimension of the matrix is correspondingly augmented [30]. 

The second simplification is based on a strong localization of the Lowdin orbitals. 
The overlap integrals decay rapidly with the distance between the atoms increased, 
which allows to restrict the summation in (5.38) only to the nearest neighbors or the 
next nearest neighbors. 

Several semi-empirical tight-binding models are available in the literature, with 
different number of orbitals and order of neighbors included: sp^s* with the spin- 
orbit coupling [4], the second-near-neighbor spin-orbit model [2, 3], second- and 
third-near-neighborwithout spin-orbit 5/7^ model [1 1,17]. The next-nearest neighbor 
sp^d^s* model [5, 14] accurately reproduces the effective masses and gaps at F, L, 
and X symmetry points of the Brillouin zone. The next-nearest-neighbor model 
is easier adapted to strained nanostructures [15,21]. The model was recently cali- 
brated to accurately reproduce the valence band of silicon and germanium [5]. The 
parameters used are from [5] and are summarized in Table 5.2. 



5.5 Comparison Between Different Numerical Methods 

We briefly compare the numerical methods of band structure calculations. Fig. 5.3 
demonstrates the band structure results obtained with the empirical pseudopoten- 
tial method (EPM) and the sp^d^s* tight-binding method. For pseudopotentials we 



Table 5.2 The parameters of the sp^d^s* model in Slater-Koster notation [26] for Si and Ge 
from [5] used in band structure calculations. A is the strength of the spin-orbit coupling. All the 
values are in eV 



Parameter 


Si 


Ge 


Es 


-2.15168 


-1.95617 


Ep 


4.22925 


5.30970 


E,* 


19.11650 


19.29600 


Ed 


13.78950 


13.58060 


X 


0.01989 


0.10132 


ssa 


-1.95933 


-1.39456 


s*s*a 


-4.2435 


-3.56680 


ss*a 


-1.52230 


-2.01830 


spa 


3.02562 


2.73135 


s* pa 


3.15565 


2.68638 


ppa 


4.10364 


4.28921 


ppn 


-1.51801 


-1.73707 


pda 


-1.35554 


-2.00115 


pdn 


2.38479 


2.10953 


dda 


-1.68136 


-1.32941 


ddjT 


2.58880 


2.56261 


ddS 


-1.81400 


-1.95120 
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Fig. 5.3 Band structure from the EPM (solid) and from the sp^d^s* model (dashed) 



use the parameters from [24]. The parameters of the pseudopotentials were cho- 
sen to reproduce the measurable quantities of silicon, like the energy gap and the 
effective masses. In addition, the spin-orbit coupling was included and calibrated 
to obtain the split-off band correctly [27] as explained in the subsection describ- 
ing the pseudopotential method. Results of the band structure calculations obtained 
with the empirical pseudopotential method and sp^d^s* method with the param- 
eters from [5] are compared in Fig. 5.3. It demonstrates that both methods agree 
reasonably well. 

The valence band of silicon was reproduced reasonably well with a much sim- 
pler sp^s* tight-binding model [8]. Recent development of a more sophisticated 
sp^d^s* model [5] containing additional parameters has further improved the 
reproducibility of the valence band. It also allowed to obtain the correct effec- 
tive masses for the conduction band of relaxed silicon. However, the conduction 
band minimum in the sp^d^s* model is further away from the X point than in the 
empirical pseudopotential calculations, where the valley minimum is located at the 
distance ko = 0.15^ from the X point (0.85|^ from the F symmetry point). 
This leads to an almost two times larger gap between the two lowest conduction 
bands at the valley minima (Fig. 5.4) compared to A = 0.53 eV found from the 
EPM. Because of this the sp^d^s* tight-binding model slightly underestimates the 
anisotropy of the conduction band. The effect is clearly visible in Fig. 5.5, which 
demonstrates that the EPM gives a more pronounced conduction band warping than 
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> 




Fig. 5.6 Comparison of bulk dispersion relations close at the minimum of the [00 1 ] valleys of the 
conduction band in [100] and [110] directions. DFT [29] and EPM [28] results are similar, while 
the sp^d^s* tight-binding model [5] slightly underestimates anisotropy 



the sp^d^s* tiglit-binding model. As follows from Fig. 5.6, the EPIVI results are 
the most accurate, when compared to the first principle density-functional band 
structure calculations obtained with the VASP [29]. 
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Chapter 6 

Perturbative Methods for Band Structure 

Calculations in Silicon 



6.1 The k'p Method for a Non-Degenerate Band 

The methods of computing the band structure considered in the previous chapter 5 
are intrinsically numerical. In some cases the analytical expressions for the band 
structure close to the extremum points are needed. The k-p method is based on the 
perturbative approach and allows to obtain the analytical band structure close to a 
chosen point provided the eigenenergies and eigenfunctions at this point are known. 
The idea of the method was formulated in the fundamental work by Luttinger and 
Kohn[ll]. 

According to the Bloch theorem, the solution of the Schrodinger equation with 
the periodic potential V{r) 

HHr) = f ^ + V(r)\ 4>{r) = £</.(r) (6.1) 

is in the following form: 

^nk(r) = e''''"M„k(r). (6.2) 

where n is the band index, k is the wave vector in the first Brillouin zone, and 
"nk(r) is the periodic Bloch amplitude. The wave functions (6.2) form the complete 
orthogonal basis set. This set is not unique. As it has been demonstrated by Luttinger 
and Kohn in their seminal work [11], the functions 

?«k(r) = e'*~''o'->„ko(r). (6.3) 

also represent a complete orthonormal basis set provided the set of the functions 
V'^«k(r) is orthonormal. Substituting the wave functions (6.3) into (6.1), one arrives 
to the following equation: 

\2mo mo J \ 2mo J 
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Assuming that the solution of the (6.4) at k = ko is known, 

Hofnko = ( T \- V(r) J V^„ko = E„u,^f„i,Q, (6.5) 

the eigenenergy £„k in the vicinity of Enko can be found by treating the term 

^(k-ko)-p 



H, 



mo 



as a small perturbation to the unperturbed Hamiltonian Hq defined in (6.5). 

At the r symmetry point ko = (0, 0, 0), and (6.4) is recast into a more familiar 
form 

f P^ hkp \ ( h^k^\ 

\ \- V{r) M„ko = [Enk- M„ko- (6.6) 

V2mo mo / V 2mo / 

Assuming that the solution of the (6.4) for ko = (0, 0, 0) 

( Z h ^(r) J M„0 = EnOUnO, (6.7) 

is known, the solution of (6.6) can be obtained by developing a perturbation expan- 
sion over — —. It is due to this term that the method has the name k-p, or "k dot p" 

mo '^ '^ 

theory. The type of the quantum mechanical perturbation theory to apply depends on 
whether the bands for which one would like to find the expressions are degenerate 
at the ko = (0, 0, 0) point or not. 



6.2 Effective Mass Theory for Non-Degenerate Bands 

Let us assume that the band n has an extremum (a minimum or a maximum) at 
the point k = 0. A generalization to the case when the extremum is at k = ko is 
straightforward [11]. Let us also assume that the Eno is non-degenerate and is well 
separated by energy gaps from other bands at the point k = 0. Treating the — — term 
as a perturbation, we find the energy in the first order of the perturbation theory: 



E„i, = E„o-\ (M„o|k-p|M„o> • (6.8) 

mo 

If the point k = ko is an extremum (a maximum or a minimum), then the first order 
correction (6.8) to the energy is zero, 

E„k = E„o, (6.9) 
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and one has to proceed to the second order perturbation theory. To do so, the first 
order perturbation theory for the wave function is applied: 

M«k = U„o-\ > -; M„'o- (6.10) 

mo "rf E„o - En'Q 

n'f=n 

The expression for the wave function (6.10) is used to obtain the energy correction 
due to the — - term in the second order perturbation theory: 

r F , fl^k^ , fl^ v^ |(Mno|k-p|M„/o)P 

i-nk = Eno + 1 J y ^ ;; ;; ««'o- (6.11) 

2mo mi ^ E„o - E„'o 

By defining an inverse effective mass tensor m^".^ as the derivative 



1 _ 9 Enk 



one gets 

mij = — bij + ^ > • (6.12) 

mo mg ^, Eno - E„'o 



The energy dispersion (6.1 1) can be recast into 

E„k = Eno + -Y^^^kiin-^kj = yk'^/w-'k (6.13) 

ij 

where ?w~Ms the effective mass tensor. It follows from the expression for the effec- 
tive mass that the bigger the energy gap between the bands the smaller is their 
contribution into the effective mass. It should be pointed out that all non-zero matrix 
elements (M„ko |k • p|M„'ko) contributing to the effective mass can be found from the 
matrix element theorem [16] by symmetry group considerations. The dispersion 
relation (6.13) is valid for the energies 

|£«k-£«o| «: min|£„o-£'«'o|, n 7^ «'. (6.14) 

If the condition (6.14) is violated, the non-degenerate perturbation theory fails 
and must be replaced by its degenerate version. An example of the description of 
the band structure in silicon based on the degenerate perturbation theory will be 
presented below. 
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6.2.1 Electron Effective Mass in Relaxed Silicon 

The electron band structure consists of six equivalent valleys shown in Fig. 4.3. The 
conduction band minima of silicon reside on the (001) axes at a distance of 0.15 — 
from the X symmetry points of the first Brillouin zone. The lowest conduction band 
A I is well separated from the nearest second conduction band A2' by the energy 
A = 0.53 eV. Thus, the non-degenerate perturbation theory can be applied to find 
the dispersion of the lowest conduction band. Close to the minima the dispersion is 
parabolic. The eigenvalues E„h are: 

£„k = E,,^ + J-2^k, S.j kj + -j^^ k, ., _j, kj, (6.15) 



where 



[unVo\Pj |M«'ko)= -^ / M«koT^ — Un'kodr . (6.16) 



Q 



Because iSn,ko is the minimum, the linear terms in A:, is zero. The expression for the 



effective mass tensor 7?i* • is: 



= — + :^l^ r. _F,. ■ (^-i^^ 



K,ij ^0 ml ^„ Enko-E„'ki, 



The effective mass tensor for the lowest conduction band A 1 is symmetric and can 
be diagonalized. In the principal coordinate system the tensor is diagonal and is 
characterized by the two transversal masses mt and the longitudinal mass mi. For 
the [001] valleys, or the valleys along the z axis the expressions for the effective 
masses are: 

_]_ ^ J_j^ ^ y^ \{uAtko\Pz\u„'kQ)\^ 
mi mo ml J^^^ EA^k^- En'\,^, 

and 

1 1 2 Y^ \{uA,ko\Px\Un'kQ)\^ ,, ,„, 

^t mo ml J^^^ EA^ko-En'ko 

Here Ai denotes the band index n of the lowest conduction band. Therfore, the 
energy dispersion can be written as: 

£(k) = — !^^^ ^^^^^ + — ^-^ ^. (6.20) 

2m; 2ot; 
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The relation (6.20) coincides with the well-known effective mass approximation for 
the energy dispersion of electrons in the conduction band of silicon (5.1) around the 
minimum located at k„,j„ = ko To evaluate the values of the effective masses, the 
knowledge of the Bloch functions and the band energies at the minimum point 
is required. These values, and thus the masses, can be computed by the band 
structure calculation methods described in the previous chapter. The values of the 
effective masses in silicon are well known experimentally. Therefore the masses 
mi = 0.91oto and mi = 0.196mo can be used to calibrate the parameters of 
semi-empirical band structure calculation methods. 



6.2.2 Approximations for the Conduction Band Dispersion 
at Higher Energies 

At higher energies the deviation from the pure parabolic dispersion appears. In 
order to take into account deviations in the density of states from that given by 
the parabolic dispersion a more general dispersion similar to (5.2) including band 
non-parabolicity is typically used for the [001] valleys [9]: 

£'(k)(l + a£'(k)) = — ^-^^ ^^^^ -t- — ^—^ ^—. (6.21) 

2m\ 2mt 

The value of the non-parabolicity parameter a = 0.5 eV~' is determined experi- 
mentally. However, the non-parabolic dispersion (6.21) is isotropic in (001) plane. 
For this reason it can only predict the modification of the density of states and can 
not describe the conduction band warping. As it was recently pointed out in [17], 
a more general description is needed to describe the subband structure correctly in 
(110) oriented ultra-thin silicon films. The conduction band model must take into 
account an anisotropy of the dispersion of the [001] valleys in the (001) plane. To 
derive a more accurate dispersion relation for the conduction band, the second con- 
duction band Aj' {i = 2) closest to the first conduction band Ai (i = 1) must 
be taken into account. We consider the pair of equivalent conduction band valleys 
along the [001] direction. Other valleys can be analyzed analogously. 

The two conduction bands A i and A2' become degenerate exactly at the X point. 
Since the minimum of the conduction band is only ko = 0.15— away from the X 
point, the dispersion around the minimum can be well described by the degenerate 
perturbation theory, which includes the two bands degenerate at the X point. Diago- 
nal elements of the Hamiltonian Ha ,i = 1 , 2 at the X point include the contribution 
of all the remote bands and can be easily obtained using the standard k-p theory [2]: 

n ,■ , ti fi^k^ h^kl fi^kl 

Hf,(k) = {-\y-'—k,p+-^ + —^+—^, (6.22) 

mo 2m/ 2m; 2m; 
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where mo is the free electron mass, rrit is the transversal, and nii is the longitudinal 
effective mass. Let us point out that because the X point is not an extremum, the 
slope of the dispersion is non-zero. The sign of the matrix elements {pzja is different 
for the band i = 1 and i = 2: p = {pz)ii = —{Pz)22- The values of A:^ are counted 
from the X point. 

In contrast to the 14 band model [7] or the recently developed 30 bands k-p 
model [14], our perturbation analysis allows to get excellent results with only two 
bands. This is because our approach is based on the perturbation theory around the 
X symmetry point, contrary to the perturbation methods [7, 14] developed around 
the r symmetry point which is far away from the conduction band minimum. 

By taking into account the diagonal elements (6.22) alone, we recover the com- 
monly used effective mass approximation for the conduction band (the linear term 
vanishes at the minimum k, = —ko). Indeed, by using the identity 



mo 



ko_ 

mi' 



(6.23) 



the diagonal elements (6.22) describe the two parabolic bands with the minima at the 
distance ±kQ symmetrically situated around the X point. The bands are degenerate 
at the X point provided the coupling between them is zero. 

Within the second-order perturbation theory the coupling between the bands is 
described by the off-diagonal terms: 



H^k) = 



7l Kxt^y 

M 



(6.24) 



The parameter M is evaluated from the k-p perturbation theory as [8]: 



1 



mi 



E 



iPx)u{Py)l2 



Ek{X)-EA,(X) 



Its value is close (but not equal) to 



1 

M 



1 

mt 



1 
mo' 



Assuming the coupling is small and taking into account that the decoupled 
bands are degenerate, we use degenerate perturbation theory to obtain the following 
dispersion relation close to the minimum at k^ = —ko'. 



£o(k) = -^ + 

2m, 

'Ih^kjck 



\ MA ) 



1/2 



(6.25) 
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Here Sk~ = k, + ko, A = 2hkop/mo is the gap between the Z\i and the A2' 
conduction bands at k, = —ko. 

In Fig. 6.1 this analytical expression (dotted contour lines) is compared to the 
numerical band structure obtained from the empirical pseudo-potential method 
(EPM) for k, = —ko- Excellent agreement is found up to an energy of 0.5 eV. Fig- 
ures 6.1-6.2 demonstrate that in [100] or [010] directions, when either kx or ky 
are zero, the dispersion remains parabolic. However, as soon as both kx and ky are 
non-zero, coupling between the bands appears which leads to non-parabolic correc- 
tion in the dispersion. The non-parabolicity is strongest along the [110] and [110] 
directions as anticipated in [17]. 

Let us now evaluate the value of the non-parabolicity parameter a in (6.21) based 
on the dispersion (6.25). Proceeding exactly as in [9], Appendix B, we arrive to the 
value a = 0.64 eV~\ which is in reasonable agreement with the experimental value 
a = 0.5eV-i. 

Finally, we would like to demonstrate that an underestimation of the conduction 
band warping in sp^d^s* tight-binding model [3] is related to the gap between the 
conduction bands at the minimum position ko. As indicated in Fig. 5.4, the gap 



0.15 



0.05 



eg" 




-0.05 



-0.05 0.05 

kx [2i:/ao] 



0.15 



Fig. 6.1 Comparison of the dispersion relation (6.25) at the valley minimum (dashed-dotted con- 
tour lines) with the EPM results (solid lines). The distance between the equi-energy contour lines 
is 50 meV 
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Fig. 6.2 Comparison of bulk dispersion relations close at the minimum of the [00 1 ] valleys of the 
conduction band in [100] and [110] directions. DFT [19], EPM [18], and k-p results are in good 
agreement. The sp^d^s* tight-binding model [3] slightly underestimates anisotropy 

between the two lowest conduction bands predicted by the sp^d^s* model [3] is 
unrealistically large. This results in a smaller coupling between the bands. The solid 
contour lines shown in Fig. 6.3 obtained from (6.25) with an unrealistic value of 
A = 1 .2 eV reproduce the results of the tight-binding model. This confirms the 
observation that the larger gap between the two bands at the valley minimum results 
in less anisotropy of the conduction band. 



6.3 Valence Band 

In the k-p theory of the valence band in silicon the three top valence bands are 
typically considered. These are the heavy hole, light hole, and the split-off valence 
bands, which will be numbered by the index n = 1,2,3. Without spin-orbit inter- 
action taken into account the three bands are degenerate at the F symmetry point. 
Therefore, the degenerate version of the k-p theory analogous to that for the con- 
duction band at the X point must be applied. The Bloch function at the point k is 
built as a linear combination of the Bloch amplitudes of the three valence bands at 
the r point k = 0: 



exp(/kr)M„k = exp(/kr) ^ C„^°'m„o 



(6.26) 
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Fig. 6.3 Dispersion at the valley minimum obtained from the sp^d^s* model (dashed contour 
lines), from (6.25) with the coiTect EPM value A = 0.53 eV (dotted-dashed contours), and from 
(6.25) with A = 1 .2 eV (solid lines). The distance between the equi-energy contour lines is 50 meV 



Considering the term Hi = (^/OTo)k ■ p as a perturbation, we obtain the following 



equation for the coefficients C„ 

3 



(0). 



E ^Kn'l'cck/i - £(2)3„„,]C„(«) = n = 1,2,3. (6.27) 



The equation has a non-trivial solution only if 

det\Df„,kc,kp-El^^8„„'\ = 0. 
Here 



D"^, 



h v^ (Mnol Pa Wlo) (mzoI Pfi l"«'0> 



E 



'Oz^« 



EnO — Eio 



(6.28) 



(6.29) 



where the summation over remote bands / 7^ n is performed. The Greek indeces a 
and P = 1,2,3 stand for the coordinates x, y,z in the principal coordinate system. 



(«, 


10 


\Px |/o) {lo\Px 


|mio) 






E„o — Eio 




(mj 


10 


\Py\lo) {lo\Py 


|mio> 
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In cubic crystals D^^, can be parameterized by the three parameters usually called 
L,M,N: 

L = — > ' '" ' ' ' '" ' ' , (6.30) 

j^j ^ ^^ Y^ ^"101 Py I'o; \'oi Py |Mio; 
Wo "Y ^"° - -^/o ' 

j^ ^ ^Y^ (t'lol ;^x Ko) (^ol Py |M2o) + (mioI Py \lo) {lo\ Px \U2o) 

mo ^ End - Elo 

Using these parameters the k-p Hamiltonian is written in the form: 

/^^+Lkl + M(kl + kl) Nk,k, Nk,k, \ 

//3X3= Nk,k, ^^+Lkl + M{kl + kl) Nkyk, 

V Nk,k, Nk,k, £-^+Lk^ + M(kl + kl)) 

(6.33) 

The symmetry of the three Bloch amplitudes m„o in diamond structures is of 
p-type [20]. This means that these wave functions are made by the p-type atomic 
orbitals with the eigenvalue L = 1 of the angular momentum operator L. The 
three Bloch wave functions m„o at the F symmetry point are called in the next as 
ki) , \e-i) and jes). Their symmetry properties are used to construct the spin-orbit 
Hamiltonian discussed in the next section. 



63.1 Spin-Orbit Coupling in the Valence Band 

The spin-orbit interaction is a relativistic effect important for carriers with high 
velocities. Since the conduction band states are constructed from the outer atomic 
orbitals where electron velocities are not large, the spin-orbit interaction is usually 
neglected for the conducting electrons. In contrast, the valence band is made out of 
internal orbitals positioned closer to the nucleus and thus characterized by relatively 
high carriers velocity. In order to describe the valence band correctly, the spin-orbit 
interaction must be taken into account properly. 

The Hamiltonian of spin-orbit interaction is usually expressed as [20] 

Hso = ALS, (6.34) 

where S is the spin and L is the angular momentum operator, and A is the constant 
of the spin-orbit coupling. Defining the operator of the total angular moment J = 
L + S, one can demonstrate that 

J2 = (L + Sf = L^ + S^ + 2L ■ S, 
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from where we find the operator L • S as 

L • S = - (j2 - L^ - S^) (6.35) 

The eigenfunctions of the spin-orbit operator are the eigenfunctions of L^, 5*^, 
and the total angular momentum square /^. These functions are different from the 
wave functions | e, ) , which are the eigenfunctions of the orbital angular momentum. 

One can extend the basis set |e,) by including the spin degree of freedom 
a =t' I- The Hamiltonian (6.33) written in the basis |ey,CT) including spin is 
block-diagonal: 

i^6x6=f^r„° V (6.36) 



However, the functions of the extended set |e, , c) are the eigenfunctions of the 
L^, L,, S^, and S, operators. These functions are related to the eigenfunctions 
|/, /,) of L^, S^, the total angular momentum square J^, and its projection /, 
by a linear transformation. In our case s = 1/2, / = 1, and the quantum number 
of the total angular momentum takes two values: j = I + s and j = I — s. The 
eigenvalues of the projection operator X are equal to j^ = —j, —j + I, ..., j — I, j . 
The eigenvalue of the operator of spin-orbit interaction in the basis | /, /,) are: 

AL ■ S |/, /,) = ^[JU + 1) - /(/ + 1) - s{s + 1)] |/, /,) . (6.37) 

Thus the Hamiltonian of the spin-orbit interaction is diagonal, with the elements 

2'-^y) = -^%' (6.38) 

where the value of the spin-orbit splitting Aso = 3A was introduced. The values 
of the spin-orbit splitting for silicon, germanium, and several III-V semiconductors 
are collected in Table 6.1. 

The coefficients of the linear transformation from the basis I ey , a) to the basis 
|/, /,) are the Clebsch and Gordon coefficients with s = 1/2 and 1 = 1. Thus the 
transformation is in the form [12]: 



Table 6.1 Spin-orbit 
splitting parameters [20] 



P' 


Hso 


\.. 


Hso 



Semiconductor 


A,o(eV) 


Si 


0.044 


Ge 


0.295 


GaAs 


0.341 


InP 


0.11 


InAs 


0.38 
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= -j=Qe,,^)-i\e2A)+2\e,,i)) 



2' 21 



^i\ei,i)-i |e2,i)), 

^{\ei,i) + i\e2,i) + \e3Ay)- 

^(\eiA)-i\e2A) k3;». 



(6.39) 



This allows to write the Hamiltonian of the spin-orbit interaction in the basis 
k,-,a)[12]: 

ki,t> \eui) k2,t> 1^2,;) k3,t> k3,;> 



Hsn = — - 



A,, 



eiA) 


( ° 


i 











-1 \ 




eia> 


—i 














i 




e2A) 











1 


—i 







e2,i) 








1 





—i 







e^A) 








i 


/ 










e3,;> 


I -1 


—i 











) 
















(6.40) 



There exist different conventions for the phase factors in defining the coeffi- 
cients in \J , J,) as compared to (6.39). The convention for the phase factors used by 
Luttinger and Kohn [10, 1 1] and Rodriguez [15] and Chao [5, 6] relative to [12] are 
summarized in Table 6.2. Different phase factors do not of course affect the physical 
results. 

The inverse transformation expressing |ey, o) via |/, /;) is: 



ki,t> = 



4> = 



k2,t> = 



1 

7! 

1 

7^ 



- 


3 

2'" 


rv5 


3 

2'" 


4)-/! 


1 

x/3 


3 1\ jl 
2' 2/" V 3 


2'2) + 


3 
2 


3 
2 


1) 


1 

"V3 


3 
2 


1 


\ /2 

rv3 


1 

2 




Table 6.2 The convention for the 
Rodriguez [15] and Chao [5,6] 


phase factors used by Luttinger and 


Kohn [10, 11] and 


Wave function 


3 3\ 

7 ■ 71 


7' 7] 


l^) 


h^) 


i^) 


i.^> 



Luttinger-Kohn 
Rodriguez-Chao 
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\e2,i) 
k3,t> 

k3,;> 






3 1 

2' 2 



+ 



3 1 

2'2 
3 1 
2' 



1 

7! 



1 



2/^ V3 



1 1 

2' 2 
1 



1 1 

2' 2 



+ 



3 
2'' 



(6.41) 



The total k • p Hamiltonian including spin-orbit interaction is usually written in 
the basis |/, /j.) [12]: 



H 



Here 



3 3\ 
2' 2/ 

3 i\ 

2' 2/ 
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3 _ 
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2' 2/ 

1 _ 

2' 



I- -\ 
I 2' 2/ 
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I- -) 
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P-Q 



-V2Q 



I 2' 2/ 
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V2i?t yist V2e 
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P + Q 
—JlR 

V2 



/'(k) 



(L + 2M) + 



3^ 
2mo 
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e(k)=-[(L-M)(^2^fc2_2^2^-] 



S(k) 
«(k) 



-N 

-1 

2V3 



I- -) 

I 2' 2/ 

V2 



(Kjc iky)k^ 



{{M-L)(kl-ey) + i2Nk^ky\ 



li _i\ 

l2' 2/ 

V2i? \ 
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Vi^ 


/let 
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/' + Z\ 








P + A 



(6.42) 

(6.43) 
(6.44) 
(6.45) 
(6.46) 



6.3.2 Dispersion of the Valence Band in Silicon 



Because the point symmetry group of the diamond lattice contains inversion, the 
Hamiltonian (6.42) is doubly degenerate, and the dispersion relations do not depend 
on the spin projection. The Hamiltonian (6.42) can be presented in the form [12]: 



"^ra 



(6.47) 



76 



6 Perturbative Methods for Band Structure Calculations in Silicon 



-20 - 



-40 



-60 



-80 



-100 



-120 



' ^^^^^" ' '^''"^ 


;,„-^ —''■ ^^ 




^s.\. 


• 


HH 




/ 
/ 

/ 
/ 


LH 




/ 


♦ - SO 




/ 




\ \ 


• 




\ \ 


/ 




\ \ 


/ \ ^ 




> % 


♦ ♦^ - 




t ♦ 


♦ ♦v " 


* 


- ♦ V 


[110] [100] 




' 1 1 1 1 1 1 1 1 1 



-0.04 



-0.02 





k [271 /ao] 



0.02 



0.04 



Fig. 6.4 Energy dispersions of the heavy hole, light hole, and split-off bands in unstraned silicon 
along [110] and [100] directions. The heavy hole and the light hole bands are degenerate at the 
r -point 



where the 3x3 matrices B^ reveal the valence band spectrum. The dispersion 
relations can be found analytically [12], however, they are quite cumbersome, and 
we address the reader to the original paper [12]. 

Dispersion of the heavy hole (HH), light hole (LH), and split-off (SO) bands in 
the [100] and [110] directions are shown in Fig. 6.4. The HH and LH bands are 
degenerate at the F point, while due to the spin-orbit coupling the energy of the 
maximum of the split-off band is 44 me V lower. The surface plots of energy disper- 
sions are shown in Fig. 6.5, Fig. 6.6, and Fig. 6.7, for HH, LH, and split-off bands, 
respectively. The figures demonstrate that the energy dispersions are not isotropic. 
In order to produce the plots we have used the parameters listed in Table 6.3. 



6.3.3 Luttinger Parameters 



In case when the spin-orbit coupling Aso is large, the coupling between the split- 
off band and the heavy hole and light hole bands can be ignored, and a simpler 
analytical dispersion for the HH and LH bands can be obtained. In this case the dis- 
persion of the HH and LH bands is determined by the following 4x4 Hamiltonian 
conveniently written as [11]: 
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Fig. 6.5 Energy dispersion of tlie lieavy Irole band in relaxed silicon 
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Fig. 6.6 Energy dispersion of the light hole band in relaxed silicon 
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Fig. 6.7 Energy dispersion of the split-off hole band in silicon 



Table 6.3 The parameters 
used for the silicon valence 
band 



Parameter 


Value 


Units 


L 


-6.53 


_*1 

2mo 


M 


-4.64 


_*1 

2mn 


N 


-8.75 




Aso 


44 


meV 



H 



h^ 



(yi + ^)fc' - 2Y2{klJ^, + kp^ + klJ^) 



2mo 

-4Y3i{k^ky}{J^Jy} + {kyk,}{jyj,} + {k ,k ^} { J ,J ^}) 



Here {} stands for an anti-commutator: 



1 



{JaJp} = -(JaJfi + JfiJa), 



(6.48) 



(6.49) 



and Jx , Jy and /, are the projection operators of the total angular momentum 
operator J. They satisfy the standard commutation relations: 



[Jx^^J — JxJy J yf X — ^ J z^ 



etc. 



(6.50) 



With the split-off band decoupled, the HH and LH bands correspond to 7 = 3/2. 
In the basis ||, /,), these momentum projection operators are represented by the 
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following matrices: 



Jx = 



/ V3 \ 
-V3 1 
-1 V3 
V -V3 ) 



Jv = 



79 



^0^/3 ^ 
V3 1 
1 V3 

V V3 y 



/. = 



/30 \ 
1 10 
00-1 
VoO -3/ 

Introducing the following dimensionless Luttinger parameters: 

Ki =-—^(L + 2M)-\, 



Y2 = 
Y3 = 



3^2 

mo 
'3h 

'3^ 



,iL-M), 



N, 



(6.51) 



(6.52) 



the dispersion relations for the HH band (£+) and LH band (£-) are conveniently 
written in the form [4] : 

^^^^^ " ^'S^ ^ S^'"'^' + ^^^' " K2)(^'^' + k^yk^ + klkl)YI\ (6.53) 
Only in case y2 = ys the dispersions (6.53) are parabolic: 



Table 6.4 The Luttinger parameters for several semiconductors with zincblende and diamond 
lattice structures [1, 13] 





Yi 


Y2 


73 


Si 


4.22 


0.39 


1.44 


Ge 


13.35 


4.25 


5.69 


GaP 


4.2 


0.98 


1.66 


GaAs 


7.65 


2.41 


3.28 


GaSb 


11.8 


4.03 


5.26 


InP 


6.28 


2.08 


2.76 


InAs 


19.67 


8.37 


9.29 


InSb 


35.08 


15.64 


16.91 


ZnS 


2.54 


0.75 


1.09 


ZnSe 


3.77 


1.24 


1.67 


ZnTe 


3.74 


1.07 


1.64 


CdTe 


5.29 


1.89 


2.46 
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E±{k) = ^{y,±2Y2)k\ (6.54) 

In a general situation the square root in (6.53) remains, and the HH and LH band 
dispersions can not be described by the effective masses. The values of the Luttinger 
parameters Yi,Y2, Ys for several semiconductors are summarized in Table 6.4. 

We stress again that the separation of the total 6x6 Hamiltonian into the 4x4 
Hamiltonian for the HH and LH bands and the 2x2 Hamiltonian for the split- 
off band is possible only, when the spin-orbit splitting Aso is much larger than the 
coupling terms between the states 1 1, /;) and | ^, /;) in the Hamiltonian (6.42). In 
silicon, however, the spin-orbit splitting is only 44meV, which is roughly IksT, 
and a more general approach of Manku [12] to diagonalize the full Hamiltonian 
(6.42) must be applied. 
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Chapter 7 

Strain Effects on the Silicon Crystal Structure 



7.1 Strain-Induced Symmetry Reduction of Silicon 
Crystal Lattice 

When homogeneous strain is apphed to a crystal, the Bravais lattice basis vectors 
a,' transforms as 

a/ = (! + £) -a,-, (7.1) 

where s is the strain tensor [1] and 1 is the unit matrix. The application of strain 
modifies the volume ^q of the primitive unit cell: 

i^O = ■^o(l + Sxx + Syy + Szz), (7-2) 

Thus, the volume of the new primitive cell i?Q changes, when the hydrostatic strain 
component, Sxx + £yy + £zz^ is non-zero. 

This is not the only modification of the crystal lattice, however. Depending on 
strain conditions, the type of the Bravais lattice may change, and the symmetry of 
the crystal lattice may be altered. The set of the symmetry transformation under 
which the crystal lattice remains invariant is in general modified by applying stress. 

Relaxed silicon belongs to Oh symmetry group. The point symmetry group Oh 
possesses the highest symmetry from all the groups of crystal lattices. 



7.1.1 Oh Symmetry 

The point group Oh contains 48 symmetry elements: 

• The unity operation, 

• Clockwise rotation of 180° about the principal axes e, (three operations), 

• Clockwise and counter-clockwise rotation of 120° about the four space diagonals 
(eight operations), 

• Clockwise rotation of 180° about the axes e^, representing the diagonals on the 
faces of a unit cube (six operations). 
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Table 7.1 Point group and symmetry elements of strained lattices that originate, when stress is 
applied along various high symmetry directions to an initially cubic lattice Oh- The Schonflies 
symbols are used to specify the point group. \P(r)\ denotes the number of elements of the point 
group 



Point group 


Symmetry elements 




\p(n\ 


Stress direction 


Oh 


E 2, 3+ 37 2, 4+ 4r I 2, 3+ 37 


7 4.+ A— 


48 


Relaxed 


Dm, 


E 4+ 47 2,- 2, I 4+ 47 2,- 2, 




16 


Stress along (100) 


Di,i 


E 3+ 37 2,'. I 3+ 37 2,// 




12 


Stress along (111) 


Dih 


E 2, I 2, 




8 


Stress along (110) 


C2h 


E 2- 1 2, 




4 


Stress along (120) 


Si 


EI 




2 


Other directions 



• Clockwise and counter-clockwise rotations of 90° about the principal axes e, (six 
operations). 

Since the symmetry is preserved for any of these 24 operations followed by the 
inversion operation, there are 48 symmetry operations in total. 

The symmetry of diamond structure is lowered by distortion of the crystal. 
Because stress applied to a diamond structure reduces the symmetry, the point sym- 
metry group of the strained crystal is a subgroup of C/,: it contains only those 
symmetry elements which are preserved under strain. The effect of a homogeneous 
strain on the symmetry of the Bravais lattice depends on the direction along which 
stress is applied. 

There exist two different paths to lower the symmetry of the point group Oh to 
the point group ^2 [1]: 

Oh -^ D^h -^ D2h -^ C2h -^ S2 (7.3) 

Oh -^ D^d -^ C2h -^ S2. CIA) 

In Table 7.1 five directions of uniaxial stress are given that yield a symmetry 
reduction to the five point subgroups D^h, D^d^ D2h, C2h, S2 of the group Oh- 



7.1.2 D^h Symmetry 

When the silicon lattice is stressed in the (001 ) direction, or along a fourfold axis e, , 
the cubic lattice of symmetry class Oh becomes a square cuboid (rectangular par- 
allelepiped on a square base), representing the Bravais parallelepiped belonging to 
class Dnh [1]- A similar symmetry reduction is observed, if biaxial strain is applied 
in a {001} plane. The D^h class is a member of the tetragonal crystal system. The 
point group D^h has 16 symmetry elements listed in Table 7.1. Note that there are 
no threefold symmetry axes, and only one fourfold symmetry axis remains. 
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The strain tensor which yields the symmetry reduction Oh -^ D^h is in the form 
£(001) from (3.34) or e[ooi] from (3.36). It has non-zero elements in the diagonal 
(e.g. £ii = £22 7^ £33), while all off-diagonal elements are zero. 



7.1.3 Dm Symmetry 

If uniaxial stress is applied along the (111) directions, or parallel to the cube diago- 
nals corresponding to the threefold symmetry axes e^ the cubic face-centered lattice 
transforms to a primitive rhombohedral lattice belonging to the crystal class Dj,d of 
the trigonal (= rhombohedral) system [1]. 

The symmetry class D^d includes, apart from the trivial unity transformation and 
inversion, the threefold rotations about the direction of stress and twofold rotations 
around three axes perpendicular to the threefold axis, resulting in twelve symmetry 
operations. 

The strain tensor resulting from stress along (111), which yields a O/, — ^ Dj,d 
symmetry reduction, is in the form £(111) from (3.34) or £[111] from (3.36). It 
contains equal off-diagonal components £12 = £13 = £23 and equal diagonal 
components en = £22 = £33- Because for biaxial strain in {111} plane the strain 
tensor has got a similar form, the same symmetry reduction by straining the crystal 
biaxially in the {111} plane is achieved. 



7.1.4 D21, Symmetry 

The Bravais lattice of the crystal class Oh is converted to a parallelepiped of the 
orthorhombic system belonging to Djh in two ways [1,7]: 

1 . Dilatation or compression of different strength along two of the three fourfold 
axes e, . This results in a parallelepiped with rectangular faces (cuboid). Of the 
five twofold axes e, and e^ of D^h , only the three e, along the edges of the 
parallelepiped remain. This symmetry reduction can be achieved by applying 
stress of different magnitude along two of the three equivalent (100) directions, 
simultaneously. In this case, the strain tensor is given by £11 7^ £22 7^ £33 and 
contains vanishing off-diagonal components. 

2. The deformation originates from shearing the unit cube, thus altering the angles 
between the basis vectors. The result is a rectangular parallelepiped with rhombic 
base, which is also invariant under D2h ■ Of the original five twofold axes e, and 
e^ only two (diagonals of the base) remain. This type of lattice results, when 
uniaxial stress is applied along (110) or from biaxial strain in a { 1 10} plane. The 
strain tensor has the form 

^£11 £12 \ 
£= |£i2£ii , (7.5) 

£33/ 
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where the components of the strain tensor are related to stress according to (3.36). 
This group has only eight symmetry elements given in Table 7.1. 



7.1.5 Cih Symmetry 

Higher symmetry reduction results from deforming the base of the Bravais paral- 
lelepiped of the orthorhombic system so that the angle between its edges is changed. 
In this way the invariant parallelepiped of the system C2h is obtained from the cubic 
lattice Oh [1]. It contains four symmetry operations given in Table 7.1 with only 
one twofold symmetry axis. 

Uniaxial stress in [120] direction can achieve this kind of symmetry reduction. 
The strain tensor has three different non-zero diagonal components and one off- 
diagonal component 

hn £12 \ 

e = I £12 £22 . (7.6) 

£33/ 



7.2 Internal Strain Parameter 

The strain tensor (3.4) determines only the deformation of the primitive cell as a 
whole. In strained lattice, the displacements of the basis atoms relative to their posi- 
tions in the relaxed lattice appears. This internal displacement has no impact on the 
lattice symmetry, however, it affects the band dispersions [3]. The internal displace- 
ment is not described by the strain tensor, and the information about the positions 
of the atoms must be obtained from the lattice structure calculations in a strained 
crystal. 

The displacement of the atoms due to stress in each fee sublattice is determined 
by the strain tensor, however, the relative displacement of the sublattices determined 
by the position of the second basis atom in the primitive cell of the strained lattice 
remains undefined. To obtain the exact position of the central atom an additional 
parameter for this displacement has to be introduced. 

To define this internal strain parameter let us consider the change of atomic posi- 
tions in the primitive unit cell under strain shown in Fig. 7.1. In the relaxed lattice, 
the central atom is situated exactly in the center of the tetrahedron. In the stressed 
lattice the vertex atoms move into their new positions defined by the strain tensor. 
If the central atom is assumed to move into its position determined by the strain 
tensor, it may get displaced from the center of the distorted tetrahedron formed by 
the four vertex atoms as shown in Fig. 7.1. However, in this case the central atom 
will be closer to some of the vertex atoms of the strained tetrahedron than to others, 
and the total energy of the system will rise. In order to minimize the lattice energy, 
the central atom will be displaced towards the center of the deformed tetrahedron 
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Fig. 7.1 Unit cell of strained diamond stmcture with the central atom being at: (a) The position 
"one" defined by the strain tensor; (b) The position "two" with equal bonds to the four vertex 
atoms; (c) The real position determined by the internal strain parameter J 



defined as the point with equal distances to each of the four vertex atoms. How- 
ever, because of an increase of the energy due to non-central forces between the 
surrounding atoms [4], the central atom does not completely relax to the center of 
the strained tetrahedron as demonstrated in Fig. 7.1. 

For general stress conditions, this additional displacement of the central atom, or 
the displacement of one of the inter-penetrating fee lattice with respect to another, 
is described by the displacement factor called the internal strain parameter ^. To 
define it, let us first determine the positions of the four vertex atoms of the tetrahe- 
dron defined by the strain tensor. Next, let us determine the position of the central 
atom in the deformed tetrahedron, called the position "one", by assuming that there 
was no additional displacements between the two sublattices, and the positions of 
atoms in the second sublattice are also determined by the strain tensor. Second, let 
us find the center of the deformed tetrahedron as the point situated at equal dis- 
tances from the four vertex atoms and call it the position "two". The actual position 
of the central atom in the strained lattice does not coincide with either of the two 
limiting positions and will be located on a line between the two defined positions. 
The internal strain parameter ^ denotes the fraction of the distance between these 
two specified positions at which the central atom is actually located. The internal 
strain parameter ^ is set to zero, if the central atom remains at the position "one" 
determined by the strain tensor, and §^ = 1 if the central atom moves to the position 
"two" characterized by all four bonds between the atom and the vertexes being of 
the same length. 

In a real crystal the value of the internal strain parameter < ^ < 1. The 
value of the internal strain parameter ^ = 0.54 ± 0.04 was extracted from the 
experiment [2]. The experimental value coincides with the results of theoretical 
calculations [5], which give ^ = 0.53. Recently, advanced theoretical calculations 
based on the first-principle ab-initio program VASP (Vienna ab-initio simulation 
program) [10] yield the value ^ = 0.5 [9], which is very close to the previously 
obtained values. 

The position of the central atom can be easily determined for practically relevant 
stress conditions. Because in case of uniaxial stress applied along (100) direction 
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the center of the deformed primitive unit cell, the position "two", coincides with the 
position "one" defined by the strain tensor, no internal displacement occurs. 

For stress in [110] direction the additional displacement along [001] axis occurs. 
The value of this additional displacement is [8] 

HI + Sxx)exy ,_ „, 

Mj: = -X 7-, ao- (7.7) 

2 1 + 8,. 

If now uniaxial stress is along [1 1 1] direction, the additional displacement is in 
the same [111] direction, with the value equal to 



■^-E^yaoihhl). (7.8) 



7.3 Strain and Symmetry of the Brillouin Zone 

The Brillouin zone of the strained crystal can be obtained by an appropriate defor- 
mation of the Brillouin zone of the relaxed crystal [ 1 ] . Because the symmetry of the 
strained crystal is reduced, the Brillouin zone of the strained crystal possesses less 
symmetry. 

A possible choice for the irreducible wedge of relaxed Si is depicted in Fig. 7.2. 
It should be noted that this specific choice is not unique, and other shapes for the 
irreducible wedge, reflecting the symmetries of Oh, can be found [6]. The volume 
of the specified irreducible wedge is i2Bz/48 in accordance with the relation (4.7). 

From (4.7) it can be concluded that the higher the point symmetry of the crys- 
tal lattice, the smaller is the volume of the irreducible wedge. The symmetries of 
the Brillouin zone and the size of the irreducible wedges for the crystal systems 
O^h , Djd . D2h , Cjh, and S2 are briefly discussed below. 

The symmetry operations of the D^h class enforce the following invariance of 
the energy dispersions under reflections 

EAk^,ky,kJ=E„i\k^\,\ky\,\kS. (7.9) 

Therefore, in order to obtain the dispersion relations in the whole Brillouin zone, it 
is enough to compute them only for the first octant. The additional symmetry 

E„{k^,ky,k,) = En(k^,k„ky) (7.10) 

for stress along [100] direction allows additionally reducing the volume of 
irreducible wedge by a factor of two. The irreducible wedge is shown in 
Fig. 7.2. 

The invariants of the energy band structure for the Dj,d class are listed in 
Table 7.2. In this table the Miller index notation of the cubic lattice is used to specify 
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Stress along 
[100] 




Fig. 7.2 Irreducible wedges for relaxed silicon and stresses along [100], [110], and [111] 
directions 



Table 7.2 Symmetry operations leaving the band structure invariant for the D^,i class. Directions 
of the rotation axes are given in Miller index notation for the cubic lattice 



Symmetry operation 



Energy 



Symmetry operation 



Energy 



E 

3[iii] 

■^[111] 

2[Iio] 

2[lui] 



E„ik,:k,.^k-J 
E„{k,,k,,k,) 




I 
3[iii] 


E„{ky,k„k,) 




3+ 
■^[111] 


E„(-ky-k,,. 
E„(-k,.-k,.. 


-K) 


2[Tio] 
2[Toi] 


E„{-k,.-k„- 


-k,) 


2loTil 



^nV f^x^ '^v 


-fc.) 


E„(-k,.-k„- 


-k,) 


E„(-ky,-k„- 


-k,) 


-C-n yf^yi K.x-> f^z) 




£Lfi\IC^, ky, fCx) 




E„(k,,k„ky) 





tlie directions of the rotation axes. The twelve involved symmetry operations give 
rise to a volume of the irreducible wedge of f?Bz/12 shown in Fig. 7.2. 

The D2h class contains eight symmetry elements, thus the volume of the irre- 
ducible wedge is i^Bz/S. When non-equal stress is applied along two of the three 
fourfold axes e, , any octant of the Brillouin zone can be chosen as the irreducible 
wedge. Certain care must be taken, when choosing the wedge in presence of uniaxial 
stress along [110] direction [7]. The irreducible wedge is depicted in Fig. 7.2. 

For stress in [120] direction the symmetry class of the lattice reduces to C2/,, and 
the volume of the irreducible wedge is .Qbz/4. Finally, for stress along directions 
different from those listed in Table 7.1 the crystal loses all rotational symmetries 
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and is invariant only under inversion. The resulting crystal class is ^2, and half of 
the first Brillouin zone must be chosen as the irreducible wedge for band structure 
calculation. 
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Chapter 8 

Strain Effects on the Silicon Band Structure 



8.1 Linear Deformation Potential Theory 
8.1.1 Conduction Band 

The deformation potential theory to describe the influence of strain on the band 
structure was developed by Bardeen and Shockley [2] and later generalized by 
Herring and Vogt [7]. Within this theory the energy is represented as a Taylor series 
in powers of lattice strain, and the expansion is truncated after the terms linear in 
strain. The theory thus relates the shifts of the energy bands to small deformations 
of the crystal as: 

AE{^) = Y,Df£ji. (8.1) 

y 
It follows from (8.1) that the band shift is linear in strain. The coefficients of propor- 
tionality D]: form a second rank tensor. This tensor called the deformation poten- 
tial tensor is a characteristic of a given non-degenerate band at a chosen point k. 
The tensor is symmetric and therefore has only six independent components. In 
cubic semiconductors the number of components is further reduced to three [7]. 

Based on the linear deformation potential theory, the shift of the conduction band 
minima with stress in silicon and germanium can be evaluated. The shift depends on 
the magnitude of forces applied and their directions with respect to the valley ori- 
entations. For arbitrary stress conditions, the degenerate minima in silicon are split. 
The value of the valley splitting, which is linear in strain within the linear defor- 
mation potential theory [7] is completely determined by the only two deformation 
potentials Z)^ and Z)„ [1]. The general form of the linear energy shift (8.1) for one 
of the six degenerate valleys / = 1 , 2, .., 6 in silicon for an arbitrary homogeneous 
deformation can be written in the following form 

Zi£« = Dj Tr(£) + £),f af £3/ , (8.2) 

where a, is a unit vector parallel to the Ko vector determining the minimum posi- 
tion of the valley / . It follows from (8.2) that the shift of the mean energy of the 
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conduction band depends on the hydrostatic pressure: 



AE 



(k) _ 



(Di + Id:, 



)7>(e). 



(8.3) 



Because the deformation potentials Z)j and Z)„ have different values at different 
locations in the Brillouin zone, the average energy shifts are different for different 
valley types. This difference has to be taken into account, when more than one 
type of valleys is considered, since the relative shift of the mean energy causes a 
repopulation of carriers between these types of valleys. For example, the energy 
shifts of the conduction valleys at the L-point is determined by the same expression 
(8.2), with the deformation potentials D^ and Dj;' computed at the L-point [1]. 
However, if the valley were at the F symmetry point, it is enough to know a single 
deformation potential to determine the valley shift: 



SE^ = D^Tr(E). 



(8.4) 



Using the above relations the valley splitting due to stress along any direction can be 
obtained once the strain tensor is known. Depending on stress conditions, the energy 
shifts of different valleys can be different. Uniaxial [001] stress or biaxial strain in 
(00 1 ) plane can be used to partly remove the degeneracy of the six equivalent valleys 
in silicon as demonstrated in Fig. 8.1. 




Fig. 8.1 Uniaxial [001] stress or biaxial strain in (001) plane partly lifts the degeneracy of the six 
equivalent valleys in the conduction band of silicon by shifting the k, valleys "3" relative to the k^ 
and ky valleys. The valleys "1" and "2" remain degenerate 
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Table 8.1 Strain-induced energy shifts of the conduction band valleys of cubic semiconductors, 
when uniaxial stress is applied along a given direction. It is assumed that the bands are not degen- 
erate at the position of their minima, although the equivalent valleys located at different points in 
the Brillouin zone can be degenerate with each other 



Stress direction 


Valley 


Valley direction 


&E/P 


[100] 


A 


[100] 


DJ{sn+2su) + D^sn 




A 


[010][001] 


D^(sn+2sn) + D^sn 




L 


[111][111][111][111] 


D]j(sn+2s,^) + D\;/^(sn+2sn) 




r 


[000] 


D^isn +2sn) 


[110] 
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[100] [010] 


D^(sn+2si2) + D^/2(si,+sn) 




A 


[001] 


D^{sn+2st2) + D^sn 




L 


[111][111] 


D]j(sn + 2sn) + A';/3(^ii + 2sn + su) 




L 


[Ill][lll] 


D]l(sn + 2sn) + D^-J^sn + 2sn-su) 
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[000] 


D^,(su+2sn) 


[111] 


A 


[100] [010] [001] 


D^(su+2sn) + D^/i{sn+2sn) 




L 


[111] 


Dllisn + 2sn) + D^J^sn + 2sn + 2s,,) 




L 


[111][111][111] 


Dliisn + 2sn) + D];/3(sn + 2sn - 2/3su) 




r 


[000] 


Di;{sn+2sn) 



The analytical expressions for the energy shifts of the conduction band valleys 
for three stress directions [100], [110], and [111] are summarized in Table 8.1 [14], 



8.1.2 Valence Band 

In case of bands degenerate at a certain symmetry point of the Brillouin zone strain 
causes not only relative shifts between the bands but also modifies their shapes. In 
order to describe the band warping, an additional Hamiltonian linear in strain is 
added to a k-p Hamiltonian of the relaxed crystal. The most general form of this 
additional strain Hamiltonian is given by its matrix element 

SH{sU^, = J2 Km'^ij' (8-5) 

'J 

where m , m' denote the corresponding components of the k-p Hamiltonian, and Z)y 
is the deformation potential operator which transforms under symmetry operations 
as a second rank tensor [8]. From the symmetry of the strain tensor with respect to 
/, j one concludes that Djj = Dji, therefore maximum six independent deformation 
potential operators exist. In cubic semiconductors the minima of the conduction and 
the valence bands are situated at the symmetry lines of the Brillouin zone which 
further reduces the number of independent constants determining the deformation 
potential operators. Due to the symmetry of the states at these points the deforma- 
tion potential operators of a particular state can be described in terms of two or three 
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deformation potential constants [7, 12]. The values of the deformation potentials are 
usually determined using electrical, optical, microwave techniques or by analyz- 
ing stress-induced indirect absorption edges. The deformation potential constants 
can also be computed numerically by comparing the band structure of strained and 
relaxed systems. The values of deformation potentials for silicon and germanium 
obtained by different methods are summarized in [5] . The formal expressions for the 
deformation potentials obtained from the perturbative expansion of the Hamiltonian 
of the strained crystal for the deformation potentials will be obtained in Sect. 8.2 
below. 



8.1.3 Stress-Induced Band Splitting of the Valence Bands 

Strain induced splitting and warping of the bands can be calculated within the 
k- p model after adding the strain-dependent perturbation Hamiltonian SH . Within 
the deformation potential theory the energy dispersions of the valence bands are 
obtained from a perturbation Hamiltonian SH(s) added to the k-p Hamiltonian 

(6.33): 

(Isxx + m(syy + £,,) nsxy ns,x 

nsxy Isyy +m(s,. + Sxx) nsyx 

nSzx nSyz l£zz + m{£xx + Syy)/ ^'^^'^ 

(8.6) 

In the basis |e, , c), where a ='[, l-, the matrix of the perturbation to the valence 
band Hamiltonian is 

_/SH(e) 03x3 Vt) .8 7^ 

V 03x3 Stl(e)J \i) 

where SH is determined by (8.6). The three parameters l,m,n denote valence band 
deformation potentials. The effect of spin-orbit coupling is taken into account by 
introducing a spin-orbit interaction term, Hso- The total 6x6 Hamiltonian of the 
valence band in strained silicon in the basis |e,, o) is the sum of (6.36), (8.7) and 
(6.40): 

H(e) = H + m,,™„ + H,,. (8.8) 

The Hamiltonian (8.8) can be diagonalized by performing a suitable unitary 
transformation. After some mathematical manipulations, Manku [9] has arrived at 
the following form describing the valence band of strained silicon: 

h'nh22h'33 + 2h\2h'23h'i3 - Ki^l^ - Kih^^ - h'^^h^^ 
-^ {h'nh'22 + h'uh',, + h'22h',, - h\2 - h\^ - hi,) = 0. (8.9) 

Here the /z,y and the h'- are determined by the matrices 7^3x3 SH{s) defined by 
(6.33) and (8.6), respectively, through the relation 
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hij = Hij + SH,j(s) h',=hu+- E(k), (8.10) 

where £(k) is the energy to be determined. Equation (8.10) can be simplified to [9] 

3 3-/ 

^^ay£(k)^yc2' =0, (8.11) 

;=oy=o 

which is a cubic equation in £(k). Its solutions give the energies for the HH, LH and 
the split-off bands for a particular value of k and arbitrary strain. The components aij 
depend on the spherical angles determining the direction k in the crystallographic 
system and are functions of the strain tensor. The strain-induced splitting of the 
valence bands can be obtained by setting k = in (8.1 1) to give 



J^aiE' =0, (8.12) 



;=0 

where the coefficients a, are defined as [9] 

■50 / , , 2 2 

flo = —^{pq + pr + qr -n Sj. 

+ n^(2nSxySy,Sxz - pel. - qel, - re^y) + pqr (8.13) 

(21 = —^{p + q + r) + {pq + pr + qr - n^s^) (8.14) 

a2 = p+q + r-Aso (8.15) 

<33 = -1 (8.16) 

4=^ly+4z + ^lz (8.17) 



and 




(8.18) 



Modification of the valence band due to uniaxial or biaxial stress is well under- 
stood within the six-band k-p model [9, 13, 17]. Here we show some results for the 
technologically relevant compressive stress along [110] direction. The values of the 
deformation potentials l,m,n used are listed in Table 8.2. 



Table 8.2 The valence band parameters used 



Parameter 


Value 


Units 


/ 


-2.44 


eV 


m 


4.37 


eV 


n 


-6.8 


eV 
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Fig. 8.2 Energy dispersions of the heavy hole, light hole, and split-off hands along [110] and 
[100] directions in compressively stressed silicon. Stress of 1 GPa is applied in [110] directions. 
The degeneracy between the heavy hole and light hole bands at the F -point is lifted 



Figure 8.2 displays the dispersion of the HH, LH, and split-off valence bands, 
when compressive stress of 1 GPa along the [1 10] direction is applied. A clear split- 
ting between the bands appears. The dependence of the band splitting on the value 
of the compressive stress along [110] direction is shown in Fig. 8.3. At high stress 
values only the HH band is occupied and thus contributing to transport. 

Apart from splitting, a substantial valence band warping appears in strained sili- 
con. Figure 8.4 shows the in-plane dispersion of the HH valence band at k, = 0, with 
and without stress. In comparison to the dispersion of the HH band in relaxed silicon 
a substantial modification of the dispersion of the in-plane is observed. The heavy- 
mass contribution from the wings along the [110] direction clearly seen in Fig. 8.4 is 
substantially reduced in silicon compressively stressed along [110] [17]. The effec- 
tive mass decrease along the [110] stress direction is also visible in Fig. 8.2. Stress 
also modifies the dispersion of the LH and SO bands. Figure 8.5 displays stress- 
induced modification of the LH band dispersion. Stress-induced band splitting and 
warping is the main reason of the hole mobility enhancement (or degradation) in 
strained silicon. 
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Fig. 8.3 Stress induced splitting of the heavy hole, light hole, and split-off bands in silicon at the 
/"-point, according to (8.12), (8.17) and (8.18). Compressive stress is applied along [110] direction 



8.2 Inclusion of Strain into Perturbative Band Structure 
Calculations 

Until now the deformation potentials describing the perturbation of the Hamiltonian 
in strained lattices were the constants introduced phenomenologically. Here, follow- 
ing [3], we outline the procedure how the deformation potentials can in principle be 
computed. 

The Hamiltonian describing the electron spectrum in a crystal subject to homo- 
geneous strain is: 



H(€) = 



2mo 



+ Vs{r) + 



AmqC^ 



(VKs[pa]), 



(8.19) 



where Ks(r) is the potential in the strained crystal. In the last term describing 
the spin-orbit interaction a = {Gx , Oy , CTj;) is the vector assembled from the Pauli 
matrices. The Hamiltonian (8.19) should be contrasted with the Hamiltonian of the 
relaxed crystal 



Hc,= 



2m I 



+ K(r) + 



h 



Airi^c^ 



(VK[pa]), 



(8.20) 



with Vs replaced by V . 
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Fig. 8.4 Energy dispersion of the heavy hole band in silicon compressively strained along [110] 
direction by applying 1 GPa stress. The dashed contour lines are for the heavy hole band dispersion 
of relaxed silicon. Contour lines are every 5 meV. 



Let us assume that strain is small. In this case only the terms linear in strain can be 
kept in the Hamiltonian of the strained crystals. However, we can not simply develop 
the difference between the crystal potentials Vs — V into the series in strain because, 
due to mismatch in the lattice constants, the difference between the potential is not 
small. To demonstrate it, let us put the first atom of strained and relaxed Bravais 
lattices at the origin. Then the displacement in the strained lattice of a remote lattice 
point Tfn is proportional to tTm and can become comparable to the lattice constant 
or even larger. It then follows that the difference Vs (Tm) — V{rm) is of the order of 
the crystal potential V(rm) irrespective of the strain value. 

In order to develop a perturbative expansion of the Hamiltonian (8.19) of a 
strained lattice, a coordinate transformation has to be first performed. This is done 
by choosing a new coordinate system in strained crystal in such a way that the posi- 
tions of the Bravais lattice points r^ coincide with the corresponding points in the 
relaxed crystal. The transformation can be written as [3]: 



r' = (l-he)-ir»i (l-e)r. 



(8.21) 
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Fig. 8.5 Energy dispersion of the light hole band in silicon compressively strained along [110] 
direction by applying 1 GPa stress. The dashed contour lines are for the heavy hole band dispersion 
of relaxed silicon. Contour lines are every 10 meV. 



or 

r = (1 -F e)r', (8.22) 

where the j component of (er')y = ^^ ^jt^'j- Taking into account the coordinate 
transformation one can derive that the momentum operator p = —itiVr transforms 
as 

p = (l-e)p', (8.23) 

where p' = —itiVr'. The transformation (8.23) takes the operator p-^ in the kinetic 
energy part of the Hamiltonian in (8.19) into 



P =P -^l^PiSijPj 



(8.24) 



and the potential energy Ks(r) into Vs((l + e)r'): 

Vs{r)^Vsai+s)r'). 



(8.25) 
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Following [3], let us change the notation r' — ^ r. The potential ^^((l + e)r) is now 
having the same period as V{r), and their difference may be expanded in a series in 
terms of strain s as: 

Vs ((1 + £)r) - V{r) = ^ Vijir)sji, (8.26) 



where 

^>W = -^ — r 1™ ' (^-27) 

Z — Oij B^O Sij 

The factor one-half for i ^ j is needed because in this case there are two iden- 
tical terms VJ/e,, and V/,ey entering (8.26). The transformation (8.21) brings the 
Hamiltonian (8.19) into 

H{e) = Ho + H^ + H^so, (8.28) 

where 



and 



^-^ = T~^ E i^'J^J ^^^"Ph - ^iSuVs[ap]j - l(ryV,]iEypj) (8.30) 



Under the transformation (8.21) a Bloch function i/f„k' = Mnkexp(;kr) of the 
Hamiltonian of the strained system (8.19) which corresponds to the eigenenergy 
E„ (k') at the wave vector k' becomes 

V^k = «;k'((l + £)r')e'"'(i+^)--' = «U(1 + e)r')e''-, (8.31) 

where k = (1 -|- £)k', or 

j 

The Bloch function (8.31) has the same periodicity as the Bloch functions in the 
relaxed lattice with the same wave vector k and can be expanded in terms of the 
wave functions ^„k (6.3): 

V^k = E^™'^"'kW- (8.32) 

n' 

Let us now calculate the change in energy due to strain 

5£(k) = E{e, (1 - e)k)) - £o(k), (8.33) 
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where £o(k) is the energy of the relaxed crystal and E{s, (1 — e)k) is the energy of 
the strained crystal at the point k' = (1 — e)k to which k is displaced under stress. We 
substitute the series expansion (8.32) for the wave function ij/^, into the Hamiltonian 
(8.28) and multiply by the conjugate function ^*,^ (r). After the integration over r 
of the resulting expression one obtains the following system of equations: 

J2 ((£«'(ko) - E)Sn'n + Hi) C„n' = 0, (8.34) 



where the perturbation Hamiltonian H' is [3] 

H = Hk + H^ + H^so + ^?k- (8.35) 

Here 

Hk=- + , (8.36) 

2mo mo 

7t=p+-^^[aVV]. 
Here H^ and H-^so ^re defined by (8.29) and (8.30), respectively, and 

Hs^. = -2^^ —■ (8.37) 

mo 

In accordance with general degenerate perturbation theory described in Sect. 6, 
by treating the Hamiltonian H' from (8.35) as a perturbation the modification of 
energy E — £m(ko) due to strain in the vicinity of the point ko is found from the 
solution of the following equation: 

det [<„,, -(E-E^ (ko))] = 0, (8.38) 

where the index m' runs through the all bands degenerate with the band £m(ko) at 
the point (ko) in the relaxed crystal, and the matrix elements in the non-relativistic 
approximation (c — ^ oo) are: 

„, _ ^^^% ^ ^ kn I ^^ Y^ (kPm'.)(kp,m) .„.Q. 

u ^ 

(8.40) 

where (pi)mm' is the matrix element of the i-th projection of the operator p, and 
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By introducing 

j^ij ^ f iPiPjUm' ij \ 

mm' I ^ mm' J ^ ' 

the Hamiltonian H' , (5.18) can be written in the form 

E,mm' ^ ' 



which is exactly the form (8.5). Therefore, (8.42) is the microscopic expression 
which in principle allows to calculate the values of the deformation potentials. In 
order to do so we need to know the Bloch functions at the point ko of the relaxed 
crystal as well as the components Vij defined by (8.27). 

It is difficult, however, to determine the explicit form of the components Fy 
because it requires the exact solution of the self-consistent problem in the strained 
crystal. It is therefore mandatory to have a numerical method that allows to obtain 
the deformation potential. Generalization of the numerical methods to compute the 
band structure described in Chap. 5 to include strain is necessary. Such a procedure 
for tight-binding sp^d^s* model was recently reported [ 1 0] . A generalization of the 
empirical pseudopotential method to include strain is outlined in the next section. 



8.3 Empirical Pseudopotential Method with Strain 

The empirical pseudopotential method can be easily adapted to incorporate strain 
effects. It has been already used to investigate the band structure of biaxially strained 
Sii_j;Gejc grown epitaxially on Sii-^jGe^, for various surface orientations [4, 11,16]. 
In general, to handle arbitrary strain conditions, the following modifications in the 
band structure calculation have to be incorporated [14]. First, the lattice vectors a,' 
of the strained crystal must be calculated by deforming the vectors a, of the relaxed 
crystal according to (7.1). The change of the normalizing volume of the strained unit 
cell Qq is determined by (7.2). 

Next, from the strained lattice basis vectors, the strained reciprocal lattice vectors 
b,' are obtained. Strain-induced symmetry reduction gives rise to a change in shape 
and volume of the irreducible wedge of the first Brillouin zone [15]. 

The basis vectors of the reciprocal lattice are used to calculate the reciprocal 
lattice vectors G' of the strained lattice. Taking into account the periodicity of the 
problem, the following expansion of the pseudo wave function (^t in the plane wave 
basis is employed. 

The form factors of the local pseudopotential are needed at the strained reciprocal 
lattice vectors. For this purpose an interpolation of the pseudopotential is required. 
Different methods have been proposed [6, 1 1 , 1 4] . One of them is a cubic spline inter- 
polation through the pseudopotential form factors, Vq, V3, Kg, V\i, and V-ik^ [14]. 
Following [11], Vq is set to —lE^/li, and Vj,kf = 0, where kp denotes the Fermi 
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wave vector of the free electron gas. Next, the additional displacement of the two 
fee sublattices in the diamond structure modeled by an internal strain parameter 
(displacement factor) ^ must be incorporated into the calculation. 

We use the empirical pseudopotential method to demonstrate the non-trivial 
modifications of the conduction band structure under shear strain in the next section. 
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Chapter 9 

Strain Effects on the Conduction 

Band of Silicon 



9.1 Limitation of the Effective Mass Approximation 
for the Conduction Band of Silicon 

The conduction band in silicon consists of six equivalent valleys with their energy 
minima located close to the corresponding X-points of the first Brillouin zone. 
Within the usually used parabolic approximation (5.1) each valley is characterized 
by two transversal and one longitudinal effective mass [1]. At higher energy a non- 
parabolic isotropic correction must be included to reproduce the density of states 
correctly [9]. 

The change in the conduction band of silicon under biaxial stress is well under- 
stood [7, 12]. Biaxial stress causes splitting between the six equivalent valleys. The 
splitting prompts re-populations between the valleys and also reduces inter-valley 
scattering. These effects lead to a substantial bulk mobility enhancement due to a 
biaxial tensile stress. Mobility in biaxially stressed silicon both in the bulk [7] and 
in the electron inversion layers [6, 17] was carefully investigated. The application of 
local stress techniques results in creating uniaxially stressed silicon in the channel. 
Since the usual channel orientation on a (001) wafer is along [110] direction, the 
local stress is typically aligned with the [110] axis. Although already being used 
in mass production, the stress along [110] direction has received surprisingly little 
attention within the research community. Only recently a systematic experimental 
study of the mobility modification due to [110] stress was performed [19]. It was 
shown that the electron mobility data on (001) substrate under [110] uniaxial stress 
is consistent with the conductivity mass depending on the stress value, in contrast 
to biaxially stressed silicon, where the conductivity mass was shown to be virtually 
independent of the stress value [7]. This effective mass dependence cannot be recov- 
ered within the effective mass approximation, and a generalization of the conduction 
band description in uniaxially stressed silicon is needed. Another shortcoming of 
the effective mass approximation for the conduction band becomes apparent in 
structures with thin silicon bodies. Confining carriers within thin films reduces the 
channel dimension in transversal direction, which further improves gate channel 
control. The quantization energy in ultra-thin silicon films may reach hundreds of 
meV. The parabolic band approximation usually employed for subband structure 
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calculations of confined electrons in silicon inversion layers becomes insufficient in 
ultra-thin films. A recent study of subband energies and transport in (001) and (1 10) 
oriented thin films reveals that even a non-parabolic isotropic dispersion relation is 
not sufficient to describe experimental data, and a direction-dependent anisotropic 
non-parabolicity must be introduced [18]. 

In order to overcome these difficulties, the effective mass approximation usu- 
ally applied for the conduction band must be generalized. An approach based on 
the full band structure computed with the empirical pseudo-potential method [5] 
is promising and has been recently generalized to include strain and spin-orbit 
interaction [14]. Although it uses a realistic band structure, the method is computa- 
tionally demanding and needs to be improved to include the self-consistent solution 
of Poisson equation. 

Another approach is based on the k-p theory. Recently, a 30-band k-p method [11] 
was employed to investigate subbands in thin films. The method gives an accuracy 
comparable with the 6-band k-p method for the valence band. Since the method is 
developed around the /^-symmetry point, it requires all 30 bands to obtain results 
for the conduction band minima located close to the edge of the first Brillouin zone. 
The two-band k-p model [3, 8, 15, 20] is developed in the vicinity of the X-point 
and thus provides a natural framework to compute the subband structure, in particu- 
lar the dependences of the electron effective masses on shear strain and thickness, in 
thin films. In the case of a square potential well with infinite walls, which is a good 
approximation for the confining potential in ultra-thin films, the subband structure 
can be obtained analytically [16]. This allows for an analysis of subband energies, 
effective masses, non-parabolicity, and the low-field mobility on film thickness for 
arbitrary stress conditions. The peculiarity of [1 10] uniaxial stress is that it produces 
an off-diagonal element of the strain tensor and results in a shear distortion of the 
crystal. Under shear deformation a significant change in the band structure appears. 
Namely, due to non-zero values of the shear deformation potential D , the degener- 
acy between the two lowest conduction bands at the X -points of the Brillouin zone 
along the [001] axis is lifted [3]. Thus, for non-zero values of an additional energy 
splitting between the two conduction bands appears at the X-point. Due to this 
splitting, the effective masses in the valleys along [001] direction are substantially 
modified and become functions of the shear stress value. It is worth noticing that 
uniaxial stress along [001] direction, which produces a biaxially stressed (001) sub- 
strate, does not contain the shear strain component. Therefore, the influence of [1 10] 
uniaxial stress on the band structure and transport must be carefully investigated. In 
the following we briefly review the main ideas behind the two-band k-p model for 
a valley in the conduction band of silicon. Then we shortly analyze the unprimed 
subband structure in (001) ultra-thin films, obtaining analytical expressions for the 
effective masses and non-parabolicity parameter. With these parameters the non- 
parabolic subband approximations for the subband dispersions are constructed. The 
non-parabolic subband dispersions can be embedded into a subband Monte Carlo 
code in order to compute the low-field mobility. 
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9.2 The Two-Band kp Model 

The energy dispersion of the conduction band in relaxed siUcon computed with 
help of several methods in [100] and [110] directions is shown in Fig. 6.2. The 
method based on non-local empirical pseudo-potentials from [12, 20] is the most 
accurate one as compared to DFT band structure results obtained with VASP [21], 
The sp^d^s* tight-binding model with parameters from [4] does not reproduce the 
anisotropy of the conduction band correctly. In addition, an accurate calibration of 
the parameters of the sp^d^s* model to describe the modification of the conduc- 
tion band in strained silicon was performed only recently [10]. The k-p theory is a 
well established method to describe the band structure analytically. The k-p method 
reproduces the band structure accurately at energies below 0.5 eV, which is sufficient 
to describe the subband structure and transport properties of advanced MOSFETs. 

In the diamond crystal structure, the lowest two conduction bands A \ and A2' 
are degenerate at the ^ -point due to a special symmetry of the diamond structure: 
the existence of three glide reflection planes, given by x = ao/8, y = flo/S, and 
z = (2o/8 [22]. The plane z = ao/8 is called a glide plane, when it leaves the crystal 
structure invariant under a translation by ^(1,1,0) followed by a reflection with 
respect to this plane. If now the strain tensor contains a shear component Sxy , which 
is a result from stressing the crystal along the [110] direction, the strained lattice 
belongs to an orthorhombic crystal system. The glide reflection plane z = ao/S is no 
longer a symmetry element, thus the degeneracy of the two lowest conduction bands 
Z\i and A2' at the ^-point can be lifted [3,8]. However, in the case silicon biaxially 
strained in (001) plane the glide reflection symmetry is preserved. Therefore, the 
crucial difference between silicon grown epitaxially on (001) Sii_j,Ge3, substrate or 
silicon uniaxially strained/stressed along an [100] from silicon stressed along [110] 
direction is the presence of the non-zero shear strain component Sxy , which removes 
the glide reflection plane and lifts the degeneracy between the two conduction bands 
Ai and A2' of the [001] valleys at the X-pont. 

In order to obtain the analytical expressions for energy dispersion within the 
k-p theory let us consider the valley pair along the [001] direction. Other pairs of 
valleys can be analyzed in a similar fashion. In relaxed silicon the two conduction 
bands Ai {i = 1) and A2' (i = 2) become degenerate exactly at the ^-points. Since 
the minimum of the conduction band is only ko = 0.15— away from the ^-point, 
the dispersion around the minimum is well described by the degenerate perturbation 
theory built at the X -point, which includes only these two bands. Diagonal elements 
of the Hamiltonian Ha of the strained crystal at the X -point are: 

fi h^k^ fi^k^ fi^k^ 

Hliy^ = i-iy-'^k,p + y^ + y^ + -f^ + ^Ec, (9.1) 

Wo 2/M; Lnit 2mt 

where i = 1 , 2, otq is the free electron mass, nit is the transversal and m/ the longi- 
tudinal effective mass. The only difference with respect to the two-band Hamiltonian 
(6.22) already discussed in Chap. 6 is the presence of the strain-induced shift of both 
valleys [2]: 
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SEc = Dd (Sxx + Syy + £z^) + D„s^, (9.2) 

with Z)j denoting the dilation and £)„ the uniaxial deformation potentials for the 
conduction band. Here we took into account that the matrix elements (pz)ii are 
different only in sign, which is positive for the lower band: 

P = (pJn = -{.Pz)22- 

The values ofk^ are counted from the X point. The coupling between the two bands 
is described by the off-diagonal terms including shear strain [8]: 

Hijik) = Hi - £)£,,, (9.3) 

where D = 14 eV denotes the shear deformation potential, and 



<(k) 



Tl KxK.y 

M 



The parameter M can be evaluated from the k-p perturbation theory: We have com- 
puted M by the empirical pseudopotential method at the point k. = —ko, where the 
numerical value is close to M ^ rrit/il — nit /mo) reported in [8]. With degenerate 
perturbation theory we obtain the following dispersion relations of the [001] valleys 
including the shear strain component for the two lowest conduction bands: 

h^kf h^{kl+kl) 

£(k) = ^ + — ^ '- + SEc 

2mi Init 

I ^ (9-4) 

I y 1 - r 1 ,17-. ^ l^xi^y 



\M<^' 



M 



Below we briefly analyze the behavior of the dispersion relations (9.5) and compare 
it to the results of the empirical pseudopotential simulations. 



9.2.1 Valley Shift Due to Shear Strain 

It follows from (9.4) that shear strain Sxy lifts the degeneracy between the conduc- 
tion bands of the two [001] valleys at the X-point by opening a gap 

SEx[ooi]£xy =2D\sxy\ (9.5) 

The splitting is linear in shear strain. This linear splitting transforms into a nonlinear 
shift of the valley minimum. This is a consequence of two facts: 

• In relaxed silicon the [001] valley minimum is located at the distance ko = 
0.15 {27t/ao) away from the corresponding X-point; 

• The position of the valley minimum depends on the shear strain component. 
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In order to find the valley shift due to shear strain we first find the position of the 
minimum from (9.4). Introducing dimensionless strain as 

^=^^, (9.6) 

the position of the minimum is determined as [15, 20]: 

A:mi„ = -^oVl-'?2, \r]\<l. (9.7) 

It follows from (9.7) that the minimum moves closer to the X-point with the shear 
strain increased. Because it depends on e^,„ the minimum approaches the X-point 
for tensile (Sxy > 0) as well as for compressive (e„ < 0) stress along [110] direction. 
At high strain values rj > I the minimum rests exactly at the X — point. To estimate 
strain value corresponding to |)]| = 1 we use A = 0.53 eV, D = 14 eV, so £„ = 
0.019, or 1.9%. This value of strain is achieved at 6GPa uniaxial stress applied 
along [110] direction. Such a value is far above the level of 2 GPa stress achieved so 
far by the semiconductor industry, however, in nanowires strain of up to 4% can be 
achieved [13]. 

The energy shift of the minimum due to shear strain can now be evaluated: 

^£shear = (9.8) 

l-{2\rj\-l)A/4, 1^1 >1 

Figure 9.1 shows the dependence of the absolute value of the valley shift as the 
function of shear strain r]. It is observed that for |?;| < 1 the energy shift is pro- 
portional to a square of shear strain (9.8), regardless the fact that we have used the 
linear deformation potential theory in (9.3). This nonlinearity is the consequence of 
the fact that the valley minimum is located not at the X-point where the linear defor- 
mation potential theory was written. As soon as the minimum reaches the X-point, 
the linear dependence of the valley shift is restored. 

Figure 9.2 demonstrates a perfect agreement between k, energy dependence pre- 
dicted by (9.4) and the results of numerical pseudopotential calculations. The gap 
linear in shear strain opening at the X-point is clearly seen for £ 7^ in Fig. 9.2. 
Because the dependence of the additional valley shift is proportional to the square 
of shear strain, the question naturally arises on how relevant this additional shift is 
and is it necessary to include higher terms beyond the deformation potential approx- 
imation to describe the shift correctly. Figure 9.3 compares the valley shift due to 
shear strain Sxy and the diagonal component e--. computed from (9.8) and (9.2), cor- 
respondingly (solid line), with the shift obtained from the empirical pseudopotential 
method (dotted line). The agreement persists up to a high level of strain. The differ- 
ence between the two curves remains small as compared to the valley shift due to 
the two contributions (9.8) and (9.2) indicating that the corrections due to the higher 
order deformation potentials are small. 
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Fig. 9.1 [001] valley energy shift as function of the dimensionless off-diagonal component of the 
strain tensor, as predicted by (9.8) and by EPM calculations. Inset: conduction band profile along 
the [001] direction for different stress values 
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Fig. 9.2 Comparison of the k- dispersions obtained from the analytical model (9.4) (lines) and the 
EPM calculations (symbols), for several values of shear strain. In the analytical dispersion (9.4) k, 
is counted from the X-point (k, = lir/a — K-), £) = 14eV and mi = 0.91;?7o 
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Fig. 9.3 Energy shifts of the [001] valleys due to £,,, and E-- computed from (9.8) and (9.2), 
correspondingly, are compared to the shift computed with the empirical pseudopotential method. 
This figure demonstrates that the contribution due to higher order deformation potentials may be 
ignored up to high strain level. This check is necessary to prove the relevance of the energy shift 
(9.8) quadratic in shear strain 



9.2.2 Stress-Dependent Transversal Effective Masses 

A shear strain component Sxy modifies the effective masses of the [001] valleys. 
Evaluating the second derivatives of (9.4) at the band minimum (9.8), we obtain 
two different branches for the effective mass across (niti) and along (mi2) the stress 
du-ection [110] [15,20]: 



mn(ri) 



mt 
nitiiri) 



•sgnW-j 




\r]\>\ 
\ri\<\ 

\r)\ > 1 



(9.9) 



(9.10) 



Here, sgn{ri) denotes the sign function. In Fig. 9.4 the analytical expressions for 
the transversal masses (9.9) and (9.10) are compared with the masses obtained from 
EPM calculations. To improve the agreement at high values of stress r] ^ I, the 
deformation potential D is set to be slightly stress dependent in the form D(r]) = 
D + Pr]^,withp =0.7eV. 
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Fig. 9.4 Dependence of the [001] valley transversal effective mass on the dimensionless [110] 
uniaxial strain as predicted by (9.9) and (9.10) (lines) and EPM calculations (symbols). Shear 
stress generates strong anisotropy in the transversal mass 



Interestingly, the effective mass depends "linearly" on the shear strain compo- 
nent, contrary to an assessment in [2] that within the linear deformation potential 
theory there should not be any warping of the conduction band existing. A compar- 
ison of the effective mass dependences on strain extracted from the mobility mea- 
surement in strained samples [19] with (9.9) and (9.10) shown in Fig. 9.5 displays 
an excellent agreement. 



9.2.3 Dependence on Strain of the Longitudinal Effective Mass 



The dependence of mi (rj) on shear strain r] can be found analogously [15,20]: 



mi(rj)/mi = 



(1 - rj') 



2\-l 



h|<l 



(9.11) 



(l-l/h|)~i, h|>l 



The transversal mass becomes infinite, when the valley minimum touches the 
^-point (Fig. 9.6). At this moment the terms higher in order than kf must be 
considered in the dispersion. 



9.2 The Two-Band k-p Model 
0.26 



113 



0.24 



0.22 



0.18 



0.16 



0.14 



_L 



_L 



_L 




_L 



_L 



0.05 0.1 0.15 0.2 0.25 

Fig. 9.5 Comparison of (9.9) and (9.10) to the transversal masses extracted from mobility 
measurements [19] 



The cyclotron mass from the experiment [8] is shown in Fig. 9.7 by symbols. 
In this experiment, a uniaxial tensile stress was applied along the [110] direction. 
This stress produces the shear strain component Sxy, which affects the masses of the 
[001] valleys. The direction of applied magnetic field within the plane (001) can be 
arbitrarily rotated. The angle <p determines the direction of the magnetic field with 
respect to the [1 10] axis. 

The cyclotron mass mc is defined by the expression 



mc(<p) = y/mimi{(p), 



(9.12) 



where 



mt((p) = 



mnm,2 



(9.13) 



rriti cos^{(p) + mt2 sin^((/>) 

Figure 9.7 indicates that, apart from the mi increase described by (9. 11), there might 
be an additional contribution into the longitudinal effective mass which results in the 
necessity to shift the theoretical curve upwards in order to match experimental data. 
However, the dependence of the cyclotron mass on angle (p is reproduced perfectly 
supporting the change of the transversal masses with shear strain described by (9.9) 
and (9.10). 
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Fig. 9.6 Stress dependence of longitudinal effective mass in the [001] valleys due to [1 10] stress. 
Effective mass diverges at ?; = 1 suggesting that full-band theory must be used for such stress 
values 
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Fig. 9.7 Cyclotron mass as function of field direction obtained with (9.12). While the angular 
dependence described by (9.12) is accurate, an additional increase to mi is introduced to reproduce 
the data from [8] 
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9.2.4 Stress and Non-Parabolicity 

Shear strain affects the value of the non-parabolicity parameter a as well. In order 
to find the dependence, we rewrite the dispersion relation (9.4) in the vicinity of the 
valley minimum ^min, which in the case of kx,ky -^ is equal to 






I>£rv - 



fi'^k^ky \ 2 



l_4l _^Z M_| , (9.14) 



and, therefore, depends on kxky. Expanding (9.4) around the minimum (9.14) for 
small Pz = k, — kmin, one obtains: 

fi^p^ h^{kl + k^) A 

E(k) = ^-^ + -—^ ^ 

Imilri) 2m, 4 

_ 2 (9.15) 



4( 



Dsxv - 



"^ M 



Taking into account the shifts of the valley minima (9.14) and the effective mass 
changes (9.9), (9.10), we rewrite (9.15) as 



E(\) = —-^ + ^ + 



2mi(r]) Imnir]) 2mt\{ri) 

h' {kl - kl) 

2M^A 

where kx = {kx + ky)/ y/2 and ky = {kx — ky)l \f2 are the momentum projec- 
tions in the rotated coordinate system, where the new x axis is along the [110] stress 
direction. The non-parabolic term written in the original xy coordinate system is 
similar to the corresponding term in [9]. However, the parabolic term contains the 
effective masses ma (if), mtiiri), which are modified due to [1 10] stress. Introducing 
new variables x = Pxl \lmt2ij]), y = Pyl \lmt\(r\), we separate the renormal- 
ization due to stress of the parabolic part of the density-of-states (DOS) from the 
non-parabolic contribution. One rewrites (9.16) at k^ = —ko(r]) as 



E{x,y)= ^- ^ ' 



■{mt2(r))x^ -mtiir])y^) 



.2,2 h' 



(9.17) 



2M^A' 



The last term gives the correction to the density of states of the parabolic bands. 
Assuming the last term to be small, we compute the correction to the DOS 
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identifying it with the isotropic non-parabolicity parameter a{r]). In order to find 
the dependence ofa{r]) on strain we write the following expression for the DOS: 



D(E) 



/dkxdky 
(2;r)2 



S{E - £(k)) = 



^/mn(vi)mt2{ri) 
2(27r)2 



/ 



d^HE,cP) 



dE 



where 



^' 



mn m,2 



K-y 



(9.18) 



(9.19) 



Taking into account the dependences (9.9) and (9.10) of the masses mt\(r]), mt2{r]) 
on the stress r], we arrive at an expression for the strain dependent non-parabolicity 
parameter q;(?7): 

l+2(?jm,/M)2 



a{r]) = (Xo- 



(9.20) 



(l-ir,mt/Myr 

Expression (9.20) is plotted in Fig. 9.8. The non-parabolicity parameter depends 
on the square of the stress value and therefore does not depend on whether stress 
is tensile or compressive. This is expected, since tensile strain in [110] direction 
produces compression along [110], thus a can not depend on the sign of stress. 
Due to the square dependence, the relative increase of a{ri) is more pronounced at 
relatively large stress values. 
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Fig. 9.8 Non-parabolicity parameter in the [001] valleys increases as function of [110] tensile 
stress, according to (9.20). Inset: Relation between dimensionless strain jj and stress in GPa 
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Fig. 9.9 Numerical DOS in the [001] valley normalized to the DOS analytical expressions 
obtained without (dotted lines) and with stress dependent non-parabolicity parameter (solid lines), 
for relaxed Si and Si under [110] uniaxial stress 



Figure 9.9 shows the density-of-states of the [001] valley as a function of energy 
relative to the minimum, obtained numerically from empirical pseudo-potential 
calculations, for the unstressed case and stress equal to 3 GPa. The value of the 
non-parabolicity parameter at 3 GPa is 70% higher than the relaxed value ao, 
according to (9.20). The DOS is normalized to the analytical DOS correspond- 
ing to the parabolic dispersion (dotted lines), with strain dependent transversal 
effective masses (9.9), (9.10) and strain dependent longitudinal mass mi(r]). With- 
out the non-parabolicity parameter included, considerable deviations in the ratio 
of the numerical DOS to its analytical value is observed at higher energies. The 
ratio becomes much closer to unity, if the non-parabolicity correction due to stress 
dependent a{r}) is taken into account in the analytical DOS (Fig. 9.9, solid lines). 
However, a deviation of the numerical DOS from the analytical model with the stress 
dependent non-parabolicity parameter is observed for energies larger than ks T at 
high stress value. This happens due to the fact that the energy difference between 
the value at the minimum and the value at the X -point decreases with stress (see 
Fig. 9.2). A pronounced peak in the numerical DOS appears, which corresponds 
to the flat dispersion close to the X-point. The energy value of the peak is shown 
in Fig. 9. 10 for different stress values. It is in agreement with theoretical predictions 
based on (9.4). For stress values larger than 3 GPa the energy difference from the 
minimum to the value at the Z-point becomes smaller than 2x ksT, as seen from 
Fig. 9.10. In this case the non-parabolic approximation with the strain-dependent 
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Fig. 9.10 Energy at the X-point relative to the energy value at the minimum. The solid line is a 
theoretical prediction based on (9.4) 



effective masses and the non-parabolicity parameter becomes insufficient for the 
description of even low-field mobility, and a full-band description is required [20] . 



9.2.5 Comparison of the Two-Band k-p Model with Strain 
to the Empirical Pseudo-Potential Calculations 



We now demonstrate that the two-band k-p model [3, 8, 15, 20] not only describes 
accurately the dependences of the valley shifts and the effective masses on the shear 
strain component. By comparing the model to results from the empirical pseudopo- 
tential method (EPM) we show that the two-band k-p model predicts the correct 
energy dispersion in a wider range of momenta in the Brillouin zone capturing 
non-parabolicity effects due to the interaction between the two lowest conduction 
bands. 

Figure 9.11 demonstrates a good agreement between the analytical band structure 
described by (9.4) and the numerical one obtained by the empirical pseudopotential 
method in the case, when tensile stress of 150MPa in [1 10] is combined with com- 
pressive stress in [110] direction. The chosen strain configuration allows to generate 
only the shear strain component. Comparison with the analytical band structure 
within the parabolic effective mass approximation with strain-dependent masses 
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Fig. 9.11 Comparison between the analytical model (9.4) (dashed lines) and the EPM calculations 
(solid lines). The contour lines are spaced at SOmeV. Tensile stress in [1 10] and compressive stress 
in [1 10] direction of ISOMPa in each direction is applied 



(9.9), (9.10) is sliown in Fig. 9.12. The agreement between all the three methods 
is good for energies close to the valley minimum. At energies larger than 40 meV 
the parabolic approximation becomes less accurate, while the analytical two-band 
k-p model (9.4) closely follows the numerical dispersion. 

The reason for such a good agreement between the two-band k-p model (9.4) 
and the numerical band structure is revealed in [3], where it is shown that the 
Hamiltonian obeying symmetry of the Brillouin zone at the X -point must be in 
the form 

H =At + Ba, + Ca^, (9.21) 

where ct, are the Pauli matrices, 1 is the unit matrix, and A,B, and C are the param- 
eters. The two-band k-p Hamiltonian for a [001] valley in silicon defined by (9.1) 
and (9.2) can be written as: 



H 



h^E h^kl ti^kl ^ \ h 

^ H -\ + &Ec + 

2mi 2mt 2mt J m 



( 
-k,pa,, -I- I Dt 



M 



Ox 



(9.22) 
which is exactly the form required by symmetry considerations. Since the valley 
minimum is not far from the ^-symmetry point and moves closer when stress is 
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Fig. 9.12 Comparison between the analytical model (9.4) (dashed lines), the EPM calculations 
{solid lines), and the parabolic approximation with strain dependent effective masses (9.9), (9.10) 
{dotted lines). The contour lines are spaced at 20meV. Tensile stress in [110] and compressive 
stress in [1 10] direction of 150 MPa in each direction is applied 



applied, the Hamiltonian (9.22) does excellent work in approximating the conduc- 
tion band structure. Because the k-p model is valid in a larger portion of the Brillouin 
zone as compared to the parabolic approximation, it can be used to investigate the 
dispersion of the quantized subbands in MOSFET surface layers and in thin silicon 
films. 
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Chapter 10 

Electron Subbands in Silicon in the Effective 

Mass Approximation 



10.1 Arbitrary Substrate Orientation 

In MOSFETs the charge is transferred in a surface inversion layer close to the 
interface between silicon and an oxide. The surface layer is created by applying 
an appropriate voltage to the gate electrode. In this case a potential well at the 
oxide interface in silicon is formed. The charge carriers are confined in this potential 
well. Charge carriers are free to move along the interface r = (x, y) and the wave 
function along the interface can be taken in a plane wave form. In the orthogonal 
direction, however, the carrier motion is confined, so the total wave function is in 
the form: 

'/'(r,z) = exp(/kr)iA„k(z). (10.1) 

The width of the well is comparable to the de-Broglie wave length of charge car- 
riers in the quantization direction. The subband energies E are found from the 
Schrodinger equation for the envelope wavefunctions ij/nk (z) : 

[T - eU{z)] exp(/kr)Vf„k(z) = £„kexp(/kr)^„k(z), (10.2) 

where U{z) is the potential energy and T is the kinetic energy operator. In case it is 
presented by the corresponding k-p expressions for the valence or for the conduction 
band the potential energy term U(z) is substituted with U{z)t where 1 is the unit 
matrix. The wave function is then a vector with six component in the former or a 
spinor with two components in the latter case. 

The conduction band in silicon consists of the three pairs of equivalent valleys 
oriented along the crystallographic axes x, y,z- To calculate the subband structure 
for any substrate orientation, one has to transform the Hamiltonian by means a 
unitary transformation from the crystallographic system Xc, yc, Zc to the interface 
coordinate system in which the potential energy is in the form U(z)- The unitary 
transformation matrix from the crystallographic coordinate system to the interface 
coordinate system is given by 
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''cos (p COS 9 — sin <^ cos (p sin f 
U = ( sin <^ cos ^ cos,(j) sin sin |, (10.3) 

— sin ^ cos ( 

and involves a rotation of about the Zc axis followed by a subsequent rotation of 
9 about the new y axis. The direction of the quantization axis is given by 

Cj = (cos (^ sin ^, sin <^ sin ^, cos ^). (10.4) 

If a single-band effective mass approximation is applied [1,2], the kinetic energy 
is in the form (9.1). The momentum operator and the reciprocal effective mass tensor 
V = m~^ in the interface coordinate system can be written as: 

Pj=J2^Jkpl^\ (10.5) 

k 

vij = J2 UaUjkvi'j^. (10.6) 



Here, Ujk are the elements of a unitary matrix, y^^ = l/mk,andmic = nit, mi are 

the transversal or longitudinal effective masses in silicon. The kinetic energy is then 

in the form: 

1 v^ 

T = -2_VijPiPj, i,j=x,y,z. (10.7) 

ij 

where pj = —ih{d/dxj) denotes the momentum operator, and v,y is the reciprocal 
effective mass tensor. 

The function V'"nk(z) satisfies the equation: 



fi^ d^^ , d^ 

— V33-r^ + ih (viskx + Vi-iky) — 
2 dz^ dz 



1^33:777 + ifl^iViskx + V23ky)-^ + (E- UizMnkiz) = 0, (10.8) 



where 

E = E- —(vuk^ + Iv^jkikj + V22kl). (10.9) 

Following Stern and Howard [3], the first derivative in (10.8) can be eliminated 
using the substitution 



l( (Vi3^1 + V23^2) I 

V V33 / 



1A(Z) = ^(Z) exp (Vi3-tl + V23-^2) . (10.10) 

V V33 

The differential equation for ^ (z) takes the form 

-^^ + ^{Ei-Uizm{z) = Q, using m^ = l/i;33. (10-11) 

dz^ n^ 
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The subband energies Ej and the functions ^, (z) do not depend on the wave vec- 
tor k for the boundary condition V^nklO) = at the sihcon/oxide interface [3]. 
This boundary condition can be applied, when the potential barrier at the interface 
is high enough so that the penetration of the wave function into the oxide can be 
neglected. The eigenfunctions t,{z) satisfy the boundary conditions linij^oo ?; (z) = 
lim,_>o i,i (z) = and are assumed to be normalized: 



/ 



dz^i{z) = l. (10.12) 



^2 



The energy spectrum is given by [3] 

E{ki , k2) = Ei + %-\{vn-—^kl + 2 (vi2 - "^1^^] k,k2 

2 L V V33/ V 1^33 / 

+ (v22-^V|]. (10.13) 

The energy levels £, for a given value of wx generate a set of subband minima 
called subband ladder. The value of the quantization mass depends on the sub- 
strate orientation. Since the conduction band of silicon consists of three pairs of 
valleys, in the effective mass approximation there exist three ladders of subbands 
for an arbitrary substrate orientation. For a particular substrate orientation an addi- 
tional degeneracy between the subband ladders is achieved. According to (10.11) 
the ladder with the largest quantization mass mx has the lowest energy. Following 
a convention, the subband ladders are called unprimed, primed, and double primed 
in compliance with the energy of the lowest subband in a ladder beginning with the 
lowest energy [1]. 

The effective mass approximation eases the computational effort considerably, 
however, as it will be demonstrated in the next section, a more general approach 
based on the k-p Hamiltonian is required to describe the electron subband struc- 
ture in stressed silicon-on-insulator MOSFETs with ultra-thin silicon body, and the 
hole subbands in the valence band. The potential energy U{z) is determined by the 
electrostatic self-consistent potential i>{z) defined by all charges in the inversion 
layer including electrons. Thus, in order to find the subbands in the inversion layer 
the Schrodinger equation (10.2) must be solved together with the Poisson equation 
determining the shape of the electrostatic potential '/'(z): 

V^0 = -An — [Nd(z) + p(z)-n{z)]. (10.14) 

Here, A^^ (z) is the doping profile in the semiconductor, and p (z) and n (z) denote the 
hole and electron concentration, respectively. The boundary conditions for the Pois- 
son equation in a surface layer are: lim-^00 ^(z) = far away from the interface in 
the bulk and 
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d0 



d^ 



(10.15) 



;=o- ■ dz 2=0+ 

at the silicon-oxide interface. In (10.15), (10.14) Kg^ and k^c are the dielectric 
permittivity of the oxide and semiconductor. 
The dispersion (10.13) 



E{ki,k2) = Ei + 



k^M-^k, 



(10.16) 



represents an ellipse with the principal axes rotated with respect to the axes x and y 
of the crystallographic coordinate system. The corresponding effective masses Wn i 
and OT II 2 associated to motion parallel to the interface are obtained by diagonalizing 
the matrix 

M-l =( -n- Vf3/V33 V,2 - V13V23/V33A ^^^^^^ 

Vvi2 - V13V23/V33 V22 - 1^23/1^33 / 

The eigenvalues m~\ and m~\ are calculated by solving the secular equation 

detflVI-i-m-i/') = 0, (10.18) 

where / denotes the two-dimensional unit matrix. 

10.2 Substrate Orientation (001) 

The interface coordinate system coincides with the crystallographic system for (001) 
substrate orientation. In the principal crystallographic system the inverse effective 
mass tensors describing the energy dispersion of the three valley pairs are 



V 



'(1) _ 



mi 


0\ 


^ 

m, 





i^O 


m7/ 



V 



'(2) _ 



/;^ 0\ 

mi 

0^0 

mj 





and y'(^) = 



/"^ 0\ 



^/ 



-!- 

mt 

1 



7/ 
(10.19) 



After the kinetic energy operator is determined, the subband structure is found by 
solving (10.2). Because the potential energy term includes the potential created by 
carriers confined in the surface layer, (10.2) must be solved simultaneously with the 
Poisson equation in a self-consistent manner. Subband structure of (001) surface 
layers in the effective mass approximation is described in details in [1]. 
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The principal crystallographic system does not coincide with an interface system 
and a coordinate transformation is necessary to determine the quantization and 
transport masses for (110) oriented substrate. From the unit vector 



.(110) _ 



1 

71 



(1,1,0)= (cos0 sin^, sincp sin 9, cos 9) 



(10.20) 



defining the substrate orientation, the angles 6 = 90° and (p = 45° of the coordinate 
transformation can be identified. The transformation matrix is written in the form 



-I/V2 1/V2\ 
U= I I/V2 l/v/2 . 
-10 / 



(10.21) 



The reciprocal effective mass tensors for the tiiree valley pairs are found from (10.6) 
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mj 
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(10.22) 



The quantization masses m^^'^' = I/V33' = 2mimt/(mt + mi) for the two 
valley pairs labeled 1 and 2 are equal. The quantization mass of the remaining valley 
is m]_ = nit- Because 2mimt/(mt + mi) > mt, the two pairs of valleys with the 
larger quantization mass m\^ form the lowest fourfold degenerate unprimed sub- 

band ladder. The remaining valleys with m\ = mt belong to the primed twofold 
degenerate subband ladder. For the transport masses the eigenvalue problem (10.18) 
for M~' has to be solved, which for (110) substrate yields 



M-i<'-2> = 











M7' = "!' 







•(3) 







mt+mj J 

Therefore, the transport masses for the unprimed subband ladder are 



,(1,2) 



.(1.2) 



'^' = mt, m\2 = (mi + mt)/2, 



while for the primed ladder 



(3) 

m..\=mi, m 



(3) 

11,2 



mt 



(10.23) 



(10.24) 



(10.25) 
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are obtained. Opposite to the subbands at (001) substrate, where the unprimed 
ladder is twofold degenerate and the primed ladder is fourfold degenerate, at a 
(110) interface the unprimed ladder is fourfold and the primed ladder is twofold 
degenerate. 



10.4 Substrate Orientation (111) 

The substrate orientation vector e„ for a ( 1 1 1 ) substrate is -7= ( 1 , 1 , 1 ) ■ The axes 
of the principal crystallographic systems have to be rotated by the angles 
cos^ = I/V3 (9 ^ 54.74°) and (p = 45°. The inverse effective mass tensors 
for the three valley pairs are 



v(i) = 



V(2)^ 
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(10.26) 



(10.27) 



(10.28) 



Thus, all valleys have the same quantization mass m^ 



(1,2,3) 



3mtmf 



^ j_T^ and therefore 
belong to the same sixfold degenerate subband ladder. The inverse transport masses 
are obtained as the eigenvalues of M~ ^ 



3(m/+mj) -s/3(m/— m/) 



(3) 



\2mi(2mi+mt) 4mim,+2mf 

from where one obtains 



2mf+mt 





(10.29) 



(1,2,3) . (1,2,3) 2ot; + Mt 

/Mii J = rrit, and fn^i 2 = ■ 



(10.30) 



for all three valley pairs. The subband ladder is sixfold degenerate. 
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Table 10.1 Principal effective masses of the silicon subband ladders for three substrate ori- 
entations, mill, and m|i,2 denote the transport masses, while m± stands for the quantization 
mass 



Surface orientation Degeneracy njn 



mia 



m± 



Ladder 



(001) 



(110) 



(111) 



rrit 
mi 



mi 



nit 

m, 

m, + mi 

2 

m, 

2mi + m, 



mi 
mi 

2m,mi 

m, +mi 

m, 

3m, mi 
m, + 2m; 



Unprimed 
Primed 

Unprimed 
Primed 

Unprimed 




t[110] 



[110] 



[112] 



(001) 



(110) 



(111) 



Fig. 10.1 Schematic subband structure at (001) (left panel), (110) (middle panel), and (111) (right 
panel) interfaces. At (001) and (110) surfaces the subbands belong to the twofold and fourfold 
degenerate ladders, while at a (1 1 1) interface there exists one ladder with degeneracy equal six 



In Table 10.1 the principal effective masses of silicon for the three substrate ori- 
entations are summarized [3]. In relaxed silicon the six conduction band valleys 
form a set of two subband ladders for substrate orientation (001) and (1 10), whereas 
for (111) oriented substrate only one subband ladder with sixfold degeneracy is 
formed, as it is schematically shown in Fig. 10.1 
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Chapter 11 

Electron Subbands in Thin Silicon Films 



11.1 Numerical Methods for Subband Structure Calculations 

Strain and hybrid orientation techniques are among the most important concepts 
to increase the performance of modern MOSFETs. The reason for the mobility 
enhancement lies in the band structure modification caused by stress. Multi-gate 
FinFETs and ultra-thin silicon body-based Silicon-On-Insulator (SOI) FETs are 
considered as perfect candidates for the 22 nm technology node and beyond. Mod- 
ification of the subband structure of inversion channels is the reason for improved 
transport characteristics of strained devices. Strong size quantization leads to a for- 
mation of quasi-two-dimensional subbands in carrier systems within thin silicon 
films. The electron subband structure is usually approximated by six equivalent 
minima located close to the X-points in the Brillouin zone. The dispersion of the 
conduction band valley is usually described by a parabolic approximation with the 
transversal masses mt and the longitudinal mass mi. Isotropic non-parabolicity 
takes into account deviations in the density of states at higher energies. In ultra- 
thin body (UTB) FETs, however, the band non-parabolicity affects the subband 
energies substantially, and it was recently indicated that anisotropic, direction- 
dependent non-parabolicity could explain a peculiar mobility behavior at high 
carrier concentrations in a FET with (110) UTB orientation [32]. Therefore, a 
more refined description of the conduction band dispersion beyond the usual single- 
band non-parabolic approximation is needed. Another reason to challenge this 
standard approximation is its inability to address properly the band structure mod- 
ification under stress, where a more general description is needed [33]. Indeed, 
shear strain also modifies substantially both the longitudinal [27, 34] and transver- 
sal [13, 27, 33, 34] effective masses. Any dependence of the effective masses on 
stress is neglected within the single-band description of the conduction band and 
can only be introduced phenomenologically. In order to describe the dependence of 
the effective mass on stress a single-band description is not sufficient, and coupling 
to other bands has to be taken into account. 

Several options are available. The k-p theory is a well established method to 
describe the band structure analytically [16]. Recently, a 30-bands k-p theory was 
introduced [20]. Although universal, it cannot provide an explicit analytical solution 
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for the energy dispersion. An efficient two-band k-p theory [4, 1 3, 27, 34] reproduces 
the band structure at low energies quite well as shown in Chap. 9. 

We use the empirical non-local pseudopotential method (EPM) for numerical 
band structure calculations. The parameters of the EPM are adjusted in order to 
reproduce the measurable quantities of semiconductors, energy gap and effective 
masses, as well as results of the first principle density-functional calculations. The 
method includes spin-orbit coupling. In our calculations of the silicon band struc- 
ture we used the parameters from [21]. As follows from Chap. 5, the EPM results 
are the most accurate, when compared to the first principle density-functional band 
structure calculations obtained with the VASP [35]. 

Accurate modeling of the subband structure is mandatory in modern TCAD tools 
in order to be able to describe MOSFET performance enhancement. The "Linear 
Combination of Bulk Bands" (LCBB) method [8] approximates the wave function 
in a size-quantized system as a superposition of the bulk Bloch functions found from 
a pseudopotential method. The coefficients in the linear combination depend on 
the confinement potential. In its original version the LCBB method did not include 
stress and was not able to describe the valence band due to lack of spin-orbit inter- 
action. With a generalization to include spin-orbit interaction and strain the LCBB 
method can be used to prove that less computationally demanding methods based on 
the k-p Hamiltonian are accurate enough thus providing a viable alternative to elec- 
tron spectrum calculations in confined systems based on a tight-binding approach. 
The two-band k-p Hamiltonian (9.1)-(9.3) describes the bulk structure of the con- 
duction band accurately up to energies of 0.5 eV [28]. It includes a shear strain 
component which is neglected in the parabolic approximation [4, 13,28]. Shear 
strain is responsible for effective mass modification and is therefore an important 
source of the electron mobility enhancement in ultra-thin silicon films [32, 34]. 
We also demonstrate by comparison with LCBB calculations that the six-band k-p 
Hamiltonian accurately describes the hole subband structure. 



11.2 "Linear Combination of Bulk Bands" IMethod 

In the LCBB method the solution 

ifE(r) = J2^n{KkJ\n,k,k,} (11.1) 

n,k- 

of the Schrodinger equation (Hq + V{z)) ifEi^) = E\j/E{r) with the confinement 
potential V{z) is obtained using the complete set of the Bloch functions \n,\i,k,) 

2 

(k = {kx,ky)) of the Hamiltonian Hq = ^ — h C/(r) of bulk silicon [8]: 

f T^ + t/(r) I |n, k, k,) = EfBiK k,) \n , k, k,) . 
\2mo J 
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Therefore, the method is suitable to get the dispersion relations for any confining 
potential in silicon. In this work we model the thin silicon film of the thickness t 
by approximating V(z) with a square well potential although the generalization to 
an arbitrary potential profile is straightforward. The hight of the potential barrier 
corresponds to the potential energy barrier at the semiconductor-oxide interface. 

The advantage of the LCBB method is that it allows to use accurate band struc- 
ture and Bloch functions obtained by a suitable numerical method without any 
additional approximation. The Schrodinger equation written in the basis of (11.1) 
takes the form [8] : 

(11.2) 
= (£„(k)-£fB(k,^,))^„(k,A:,), 

where i?n(k) is the «-the subband energy at k. It is important to stress that in the 
summation of ( 1 1 .2) k^ must vary in the whole periodicity interval of the reciprocal 
lattice vector space from [—2n/aQ,2n/aQ\ along the k, direction [8]. The matrix 
element of the confining potential can be expressed via the overlap integrals of the 
periodic parts of the Bloch functions (M„,k,fc l^n'^k,*:,') as 

{n,\i,k,\V(z)\n',\i,k'}j 

— J2(g,g-)i'^n,k-S,ICz-gz {"n'MMj (11-3) 

where (g, g-) is the reciprocal lattice vector, 

V(k,)= I '^V(z)&xp(ik,z) 

is the Fourier harmonic of the confining potential and 5k,k'+g is the Kronecker's 
delta [8]. The Fourier transform V{q) decays rapidly for q >2 ^/gf + g^ . This 
makes it sufficient to take only a few terms in the summation over the reciprocal 
lattice vectors in (11.3) to obtain the desired accuracy of in the subband energies 
EM(k). 

Important ingredients of the LCBB method are the overlap integrals ( 1 1 .3) of the 
periodic parts of the Bloch functions. Overlap integrals of the Bloch functions from 
the first and second conduction band computed with pseudopotentials from [34] 
are in excellent agreement with previously published results [8] as demonstrated 
in Fig. 11.1. The advantage of the pseudopotential method [34] is that it provides the 
band structure in strained silicon. The method also includes the spin-orbit coupling 
which is important to describe the valence band. 

We first investigate the subband structure of a (001) silicon film. The disper- 
sion relations for the unprimed subbands in a 5.4 nm thick relaxed Si film in [110] 
direction are shown in Fig. 11.2. The subband calculations based on the full-band 
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Fig. 11.1 Comparison of the overlap integrals obtained with the two sets of pseudopotentials 
from [8] and [34]. (/ , j ) means the overlap integral between the Bloch functions of the conduction 
bands i and j 



Fig. 11.2 Dispersion 
relations of the unprimed 
subbands for the relaxed 
(001) 5.4 nm thick Si film. 
The subband degeneracy is 
lifted for finite momentum in 
[110] direction 




0.05 0.1 

k[110][2n/a] 



consideration demonstrate that the two unprimed subbands with the same quan- 
tum number, which are completely equivalent in the effective mass approximation, 
develop quite a difference in energies for finite momentum in [110] direction. This 
result suggests that even the effective masses of the two unprimed subbands with 
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K [2n/ao] 

Fig. 11.3 Primed subband dispersion relations for a (001) unstressed 5.4 nm thick Si film in [001] 
direction. The third valley [8] splits for finite ky. It indicates that the third valley may originate 
from the two [100] primed subbands 



the same quantum number may be different, an observation that has never been 
mentioned before. 

Dispersion relations of primed subbands in a 5.4 nm (001) silicon film are shown 
in Fig. 1 1 .3 at A; 3, = and also for non-zero values of ky . For ky = Owe can clearly 
see the third valley with the minimum at the X-point [7, 8]. However, for ky ^ 
it clearly splits in two branches, with the energy of the lower branch decreasing 
while increasing ky and moving away from the Z -point in [100] direction. For 
ky = 0.15(27r/ao) and kx = 2n /uq the energy of the lower branch becomes 
equal to the energy of the unprimed subband at the minimum kx = O.S5(2jt/ao), 
ky = 0. A similar behavior is observed, when the value of ky is inverted to ky -^ 
—ky . This behavior indicates that the third valley originates from a pair of primed 
subbands with minima along [100] direction. Thus, one can conclude that the feature 
interpreted in [8] as the "third" valley is the cross section of the primed subband 
dispersions with the minima in [100] direction by the ky = plane. 

Next we consider an example of a (00 1 ) film stressed along [110] direction. Stress 
in [110] direction generates the shear strain component which causes a profound 
modification of the bulk dispersion [13, 27, 34]. Due to the shear strain component 
the degeneracy between the unprimed subbands with the same quantum number 



is lifted even at kx 



resulting in a large strain-induced valley splitting 



(Fig. 11.4). Substantial differences in the dispersion along [110] (solid lines) and 
[1 10] (dashed lines) directions indicate strong modification of the effective masses 
due to strain and reduced thickness [27]. Finally, we have carried the subband 
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Fig. 11.4 Strain-induced 
splitting between unprimed 
subbands is observed in a 
(001) 5.4 nm thick Si film. 
Strain dependence of the 
effective masses results in the 
difference between the 
subband dispersions in [110] 
and [110] directions 



0.25 



Fig. 11.5 Dispersion relation 
of hole subbands in a (001) 
relaxed Si film of 5.4 nm 
thickness obtained with the 
LCBB and the six-bands k-p 
method. Good agreement is 
achieved thanks to the 
inclusion of spin-orbit 
interaction into the EPM full 
band calculations 
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calculations for the valence band. The non-local pseudopotentials [21, 34] include 
spin-orbit interaction and thus the generalized LCBB method accurately describes 
the hole subbands. A comparison of subbands obtained with the LCBB method and 
the 6-bands k-p method is shown in Fig. 1 1.5. Spin-orbit interaction is included in 
LCBB leading to the formation of split-off subbands. 

Now we investigate in details the peculiarities of the electron subband structure 
by using the two-band k-p method. The main interest will be to understand the 
unprimed subband splitting induced by strain. 
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11.3 Unprimed Subbands in (001) Films: Analytical 
Consideration 



In this section we analyze the structure of unprimed subbands in thin (001) ori- 
ented silicon films based on the two-band k-p Hamiltonian (9.1). We perform an 
analytical analysis of the subband structure of the UTB silicon films under uni- 
axial strain. For simplicity the assumption of a square well potential with infinite 
potential walls is used. This assumption is only valid as long as the ground sub- 
band energy is much higher than the amplitude of the potential profile variation 
within the film. As soon as this assumption breaks down, a generalized numerical 
treatment of the k-p Schrodinger and Poisson equations is required. It is shown [3] 
that this numerical procedure can be implemented: the electron subband structure is 
obtained self-consistently thus completing the analysis of uniaxially strained UTB 
silicon films and inversion layers. 

The two-band k-p Hamiltonian of a [001] valley in the vicinity of the X -point of 
the Brillouin zone in Si must be in the form (9. 1) [4]: 



+ V{z)U + [2DE,y-—^y, 



(11.4) 

where ax,z are the Pauli matrices, 1 is the 2x2 unit matrix, /cq = 0.15 x 2n/aQ 
is the position of the valley minimum relative to the X-point in relaxed Si, kj with 
i e {x,y,z} is the wave vector, Sxy denotes the shear strain component in physics 
notations, M~^ = m~^ — tiIq^, and D = 14eV is the shear strain deformation 
potential [4, 13,28,34]. 

The confining potential V{z) along the [001] direction modulates the conduc- 
tion band profile. As long as the ground subband energy is much higher than the 
amplitude of the potential profile within the film, the confining potential in an 
ultra-thin silicon film can be approximated as square well potential with infinite 
walls (Fig. 1 1.6). This approximation is applied to single- as well as to double-gate 
structures. 

For a potential with infinite potential walls the wave function must be set to zero 
at the boundaries. This simplification allows an analytical analysis of the unprimed 
subband structure in (001) thin silicon films. 

Let us briefly outline the procedure to obtain the subband dispersion relations. 

In the two-band model the wave function is a spinor with two components: 

where a (k) and ii(k) are constants which depend on the wave vector k. Substitution 
of (1 1 .5) into (1 1 .4) and as assumption that the potential in the thin film is zero leads 
to the following eigenvalue problem: 
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Fig. 11.6 Potential in an 
ultra-thin SOI film of a 
single-gate MOSFET (left) 
and a corresponding model 
square well potential with 
infinite walls 
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(11.6) 



Taking the determinant of ( 11 .6) and setting it to zero results in the energy dispersion 
relation of the system: 



£'(k) = ^^^ + —^ — ± 



2mi 



2m, 



+ 



\ mi J 



(11.7) 



with 



8 = 



(. 



M 



(11.8) 



This is the same dispersion as (9.4), however, only discrete k- values are allowed 
in a thin film. To find these values and the subband quantization energies at fixed 
kx and ky, the off-diagonal term h^kxky/M leading to the conduction band non- 
parabolicity is included into the definition of S in (11.8). 

For each energy E there exist four solutions of k, (Fig. 1 1 .7). Figure 1 1 .7 shows 
that the dispersion ^(k) is even with respect to the X-point at k = 0. Therefore, 
from the four roots k. only two are independent. The missing two roots are obtained 
by alternating the signs of the values of the two independent roots ki and k2- The 
values of the roots are not always real. For non-zero shear strain the gap between 
the two conduction bands is opened at the X-point (Fig. 8.2). For energies within 
the gap the /c2 and — /c2 values are imaginary. 

The wave function at the energy i? is a superposition of the four eigenvectors 
with k~ being the solutions of the equation E = E(k,): 



nz) 



^[aiki)\ ik,.z (a 
\b{k,)J ^\b 



(-ki) 
H-ki) 



-ik\ 



■■+ 



a{k2) 
bik2y 



Jk2 



(a(-k2)\ 

\bi-k2))' 



-iko 



(11.9) 
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Fig. 11.7 Conduction band profile close to the X-point for jj = (solid lines), rj = 0.5 (dashed 
lines), and rj = 4 (dashed-dotted line) 



The subband quantization energies are obtained by setting to zero both compo- 
nents of the spinor at both film interfaces. This results in a system of four linear 
homogeneous equations for the coefficients in the linear combination. To find the 
dispersion equation we introduce the ratio c(k.) between b(k,) and aik^). c{k.) is 
an odd function with respect to k,. Here we assume that kx and ky are constants, so 
only k^ is preserved as an argument of the functions: 



c{k,) = 



b{kd 
a (k.) 



mi 



2mi 



^ + 



Imi 



+ 5 - E{k,) 



(11.10) 



The boundary conditions 



iA(z = ±r/2) = 



(11.11) 



for a symmetric confining potential demands that the solution satisfies a(—k^ 
:ka{k~). This leads to the wave function in the form 






(11.12) 
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After some simplifications, tlie two independent systems of equations are 
obtained: 

a{ki)cos{kit/2) + a(k2)co?,{k2t/2) = 0, (11.13) 

a{ki)c{ki)&m(kit /2) + a{k2)c{k2)^in{k2t /2) = 0, (11-14) 

<3(yti)sin(/cif/2) + a(fc2)sin(yt2f/2) = 0, (11.15) 

a(ki)c(ki) co&(kit /2) + a(k2)c(k2) cos{k2t /2) = 0. (11.16) 

Expressing a(k\) with (11.13) and (11.15) and putting them into (11.14) and 
(1 1.16) leads to these two conditions: 

tan(;cif/2) = ^^^ tan(yt2r/2), (11.17) 

c{ki) 

cot(A:if/2) = 4M cot{k2t/2). (11.18) 

c{ki) 

In case when the solution k2 is imaginary, the corresponding trigonometric 
functions are substituted with the hyperbolic ones: 

\.w(kit/2)=—-^ tanh(A:2?/2), (11.19) 

c{ki) 

cot{kit/2) = -—-^ coth(A;2f/2). (11.20) 

c{ki) 

After introducing the dimensionless variables defined as 

ki2 2h^kl E 28 

ko nil Eq a 

and a few calculation steps presented for convenience in the Appendix (1 1.9.1), the 
equations (11.18) can be written in the form: 

tan(Xiytor/2) = — ^ \^n{')( .k^i j 2) , (11.22) 

^1 ^±7^2 + XI 



X^ ri±Jri^ + Xl 
coi{Xikot/2) = -^ \ cot{X2kot/2). (11.23) 

For the numerical solution it is convenient to get rid of the trigonometric func- 
tions with singularities and to rewrite the equations as 
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s.m{Xikot/2)cos,{X2kot/2) = -^ ^ ■s,m{X2kQt /2)cos.{Xikot/2), 

rj ± ^n^+Xi 



Xi 



(11.24) 

X^ rj±J^f+xj 
cos{Xikot/2)sm{X2kot/2) = -^ \ ■ cos(X2kot/2)sm{Xikot/2). 



X 



1 r] ± Jr]^+X^ 



(11.25) 



The two variables Xi and X2 still coexist in these equations. Therefore, to find 
either of them, an extra relation to re-express Xi as a function of X2 or vice versa 
is needed. As demonstrated in Appendix(l 1.9.1), these relations are: 

X^ = X^ + 4 + 4^Xi + T]^, (11.26) 

Xi = X^ + 4-4y/xf + r]2. (11.27) 

The procedure for obtaining the energies of unprimed subbands in (001) thin 
films is as follows. First, one has to re-express one of the ^'s (say X2) via another 
(Xi) with help of (1 1.27) (or (1 1.26)) and solve the (1 1.24) and (1 1.25) Z as a func- 
tion of the parameter rj. Because of the trigonometric functions present in (1 1.24) 
and (1 1.25) there will be two infinite ladders of solutions as functions of tj corre- 
sponding to the unprimed subband ladders. The subband energy is computed by 
substituting the solution ki into (1 1.7). 



11.3.1 Dispersion Relations from an Auxiliary 
Tight-Binding Model 

The dispersion relations (1 1.22) and (1 1.22) were obtained from the two-band k-p 
theory (11.4). The k-p theories are usually applied, when the characteristic scale 
of the variation of envelope function is much less than the lattice constant. In con- 
trast, an atomistic tight-binding approach allows to obtain the whole Bloch function 
and is free from the limitations of the k-p method. Following Boykin [5], we intro- 
duce an auxiliary tight-binding model defined on a lattice of sites each containing 
two localized orbitals a(z) and y6(z). The Bloch functions u(z) and v{z) at the 
X-points corresponding to the two bands considered within the two-band k-p theory 
are expressed via the orbitals as 

m(2) = ^Q;(z-Ma) (11.28) 



2mi 2mt 
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C f,^k-ko 
^ m, 
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tt^kf h^(kl+kl) 
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and 

v(z) = ^i^{z-na), (11.29) 

n 

where a is the lattice constant, and n is an integer. The Bloch function for an 
arbitrary k^ counted with respect to the X-point is written in the form [16]: 

^{z,k,) = y^ exp{ik^na){a(kju(z — no) + ib(k,)v(z — no)). (11.30) 

n 

The coefficients a{k~) and b(k,) are determined by the two-band k-p model as: 

+ S-EJ \HkdJ 

(11.31) 

As shown in Fig. 11.7, for a particular energy E there exist two pairs of solutions 
of kz = ±/ci and K^ = ikj- The wave function for a finite array of A^ sites 
corresponding to the energy E is: 

N/2 

^±{z, E) = 2_. /_. Ci{e\p{ikina){a{ki)u{z — na) + ib{ki)v{z — na)) 

i = l,2n=-N/2 

+ exp(—ikina}{a(ki)u{z — na) — ib(ki)v{z — na))), 

(11.32) 

where C, is a constant. Taking into account strong on-site localization of the orbitals 
a and fi and their mutual orthonormality, the allowed values ofk\ or /c2 in the well 
with infinite potential walls are found from the condition that the wave function 
is zero at the film interfaces z = t/2. This results in two sets of equations (kj is 
assumed real) corresponding to the plus sign in (1 1 .32) [5]: 

aiki) cos(kit /2) + a{k2) cos{k2t /2) = 0, (11.33) 

biki) sinikit /2) + bik2) sm(k2t /2) = 0. (11.34) 

For the plus sign one obtains 

a{ki) sinikit /2) + aik2) sm{k2t /2) = 0, (11.35) 

b(ki)cosikit/2) + b(k2)cosik2t/2) = 0. (11.36) 

Equations (1 1.33), (1 1.34) result in the dispersion equations, correspondingly: 

,n w^^ a{k2)b(ki) 
coi{kit/2)= ————— cot(k2t/2) (11.37) 

a{ki)b(k2) 
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tan(/ci?/2) = — -— — — - tan(A;2f/2) (11.38) 

a{k^)b(k2) 

Interestingly, the dispersion equations (11.38) are the same as (11.18). The only 
condition used to obtain (1 1.38), was the smallness of the localization orbital radius 
as compared to the film thickness. Therefore, this condition determines the region 
of validity of the dispersion equations (11.18) and (1 1.38). 

Let us now reformulate (11 .24) and ( 1 1 .25) for the sum and the difference of Xi 
and X2. First we introduce the transformation rules for j„ and y„ as, 

A\ — A2 - ^1 ~\~ -^2 

yn = and yn = (11.39) 

or 

{yn+ynf = X^ and iy„-y„f = Xl (11.40) 

We only show the derivation for (1 1.24), due to the similarity with (1 1.25). Using 
the above given transformation and rewriting (1 1.24) to separate y„ and y„ leads to 
the following expression: 

sm{y„kot) + sm(y„kot) = ■ (-sin(y„kot) + sm{y„kot)) (11.41) 

c{Xi) 

Further simplification steps result in: 

. , , ,, ciX2)-c{Xi) . ,- , . r^^ A'^^ 

&m{ynkQt) = — — — sm{y„kot) (11.42) 

c{X2) + c(Xi) 

Now, as it is shown in Appendix(l 1.9.2), one can re-express _y„ as function of y„ 

The derivation of the fraction containing c{k\) and c(/c2) can also be found in 
Appendix (1 1.9.2). The resulting dispersion equation is finally in the form: 



y„ sin ( J - 



sHynkot) = ± V ; ==^ (11.44) 

This equation can be easily analyzed allowing to understand the role of shear strain 
and non-parabolicity on the structure of unprimed subbands in thin (001) silicon 
films. 
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11.4 Strain-Induced Valley Splitting 
11.4.1 Small Strain Values 

For zero stress the ratio on the right hand side of (11.44) is equal to zero, and 
the standard quantization condition q„ = jtn/ kt^t is recovered. This condition is 
obtained from either of the two equations, therefore, the subbands are twofold 
degenerate. For non-zero shear strain and/or kx ^ and ky ^ Q the parameter 
rj y^ 0. Due to the plus/minus sign in the right-hand side of (1 1.44), the equation 
splits into two non-equivalent branches for ?; 7^ 0. (1 1.44) is nonlinear and can be 
solved only numerically. However, for small r] the solution can be sought in the form 
j„ = ^„ ± ^, where ^ is small. Substituting y„ = q„ into the right-hand side of 
(11.44) and solving the equation by perturbations with respect to t,, we obtain for 
the difference between the energies of the two unprimed subbands with the same 
quantum number n : 

— Dsxykot 1/ " ■ (11.45) 



The two ladders of unprimed subband are conceived by the two [001] valleys, which 
are degenerate in the bulk. Interestingly, the degeneracy is lifted in thin films, when 
the dispersion in [001] direction is non-parabolic. This is why the energy splitting 
(1 1.45) is often referred to as valley splitting. 

The valley splitting is shown to be linear in strain for small shear strain values and 
to depend strongly on the film thickness [29] . In accordance with earlier publications 
[2, 5,8], the valley splitting is inversely proportional to the third power of ko and the 
third power of the film thickness t. Due to the cosine term in the right-hand side 
the value of the valley splitting oscillates with film thickness, in accordance with 
[5, 8]. In contrast to previous works, the subband splitting is proportional to the gap 
S at the X-point, (and not at the T^-point [2, 18, 19]) which depends strongly on 
shear strain. Thus the application of uniaxial [110] stress to [001] ultra-thin Si film 
generates a valley splitting proportional to strain. This effect can be used to generate 
controllably the valley splitting larger than the spin Zeeman energy, a fact which has 
not been yet appreciated in spin-related applications of silicon. 



11.4.2 High Values of Shear Strain 

For high strain values the dispersion (1 1.8) of the lowest conduction band become 
parabolic again (shown in Fig. 1 1 .7) and the quantization levels in a square well 
potential with a parabolic band must be recovered in this limit. We note that in the 
limit & ^ Eq X2 = '^J—y\ and equations (1 1.22) and (1 1.23) take the form [30]: 
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tan(Xi^o?/2) « ^^^ -^, (11.46) 

cot(J^ifcof/2) ^ ^^ 4=- (11-47) 

a(^i) v^ 

For large ?; (11.46) has the solution Xi = n(2n - l)/kot while (11.47) 
gives Xi = 2jrn/kot which results in the well-known quantization result Xi = 
nm/kot,m = 1,2,3,--- for subbands in an infinite potential square well with 
a single parabolic band. For the difference in energy AE„ between the two sub- 
bands with the same quantum number n (which are degenerate at ?] = 0) we get 
AEn = Ei{An — 1) in the limit of large r]. 



11.4.3 Numerical Solutions 

The numerical solutions were obtained in two ways. First, (11.22) and (11.23) are 

solved to obtain for the subband structures. The value X2 = J ^f + ^ — iJrj'^+Xf 
becomes imaginary at high strain values. A special care must be taken to choose 
the correct branch of Jx^ + yf in (1 1.22)/(1 1.23) or alternatively (1 1.24)/(1 1.25). 

The sign of v/^| + rf- must be alternated after the expression becomes zero, as it 
is displayed in Fig. 11.8. 

In order to be sure that the results are correct and to bypass the limitations of 
(11.22) and (11.23) which were obtained for a square well potential with infinite 
potential walls, a direct discretization of the differential operators in the Hamiltonian 
(11 .4) on a mesh, with the subsequent numerical solution of the corresponding sys- 
tem has been performed [3]. The later method allows to include the finiteness of the 
potential barrier at the interface of a film, to consider arbitrary film orientation, and 
to solve the Schrodinger equation self-consistently with the Poisson equation. The 
equation determined by the Hamiltonian (11.4) is elliptic, so no spurious solutions 
have been observed after discretization, and the solution of the eigenvalue problem 
is straightforward. Numerical results for (001) silicon films shown below obtained 
by both methods were equivalent. 

Figures 11.9 and 11.10 show the energies of the subbands as a function of shear 
strain for two different film thicknesses. Shear strain opens the gap between the 
two conduction bands at the X-point making the dispersions non-parabolic [28], 
which makes the (11.22) and (11.23) non-equivalent. This removes the subband 
degeneracy and introduces the valley splitting. Figures 11.11 and 11.12 show the 
energy difference between two unprimed subbands AEn as a function of strain for 
the same quantum number n . At unrealistically large strain values when the valley 
minimum is located at the X-point, and the dispersion, according to (1 1.7) becomes 
parabolic again, the well-known quantization conditions Xi = 7tm/kot,m = 
1, 2, 3, • ■ • are perfectly observed in Figs. 11.11 and 1 1.12. 
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Fig. 11.8 The right hand side of (1 1.22) and (1 1.23) plotted close to the point ^Jrf + Xl = 0. It 
is clearly seen that the sign of the square root must be alternated at this point 



Fig. 11.9 Subband 
quantization energies E„ 
(normalized to the ground 
subband energy) for a film 
thickness of 3.3 nm. The 
valley splitting appears for 
non-zero shear strain rj 
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Figure 11.13 shows the valley splitting for several film thicknesses as a func- 
tion of shear strain. In ultra-thin films already at moderate stress levels the splitting 
energy becomes larger than k^T. It is seen again that for small strain values the 
splitting is linear in shear strain. For large strain the quantization relations in an infi- 
nite square well potential with a single parabolic band are recovered resulting in the 
largest subband splitting. Uniaxial stress is currently used to enhance performance 
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Fig. 11.10 The same as in 
Fig. 1 1.9 for a film thickness 
6.5 nm. The valley splitting 
depends strongly on the film 
thickness. The valley splitting 
is maximal at high strain 
values 



Fig. 11.11 Difference of the 
subband quantization 
energies A E„ (normalized to 
the ground subband energy) 
from (1 1.24) and (1 1.25) for a 
film thickness of 3.3 nm. The 
valley splitting appears for 
non-zero shear strain rj 
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of modern MOSFETs, where it is introduced in a controllable way. Therefore, the 
valley splitting can be controlled by adjusting strain and thickness t. 



11.5 Effective Mass of the Unprimed Subbands 



In Fig. 11.14 the dependences of the transversal masses on shear strain in bulk sili- 
con (9.9), (9.10) (lines) and of the effective masses for the two lowest subbands with 
« = 1 (symbols) along [110] and [llO] in a film of a thickness t = 10.86 nm are 
compared. The transversal masses of the two ground subbands are in good agree- 
ment with (9.9), (9.10). However, when the film thickness decreases a substantial 
discrepancy between (9.9), (9.10) and the numerical solutions in films appears. 
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Fig. 11.12 /i£:„(j))forafilm 
thickness of 6.5 nm. The 
splitting depends strongly on 
the film thickness 



rig. 11.13 Splitting induced 
by shear strain for several 
film thicknesses is shown. As 
can be seen for ultra-thin 
films the splitting is already 
larger than kB T for moderate 
stress levels 
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especially at high values of strain, as shown in Fig. 11.15. Figure 11.16 shows that 
the curvature effective masses along [1 10] and [l lO] become different for decreas- 
ing film thickness even in relaxed films. The dependence of the effective masses of 
the two ground subbands without strain on film thickness is shown in Fig. 11.17. 
It does not of course mean that the subbands are parabolic, and the effective mass 
along [1 10] or [110] is computed as the inverse of the corresponding second deriva- 
tive. The non-parabolic subband dispersions are demonstrated in Figs. 11.18 and 
11.19. For a given kx and ky the subband with a lower energy has the equi-energy 
surfaces in the form of the unification of the two ellipses with the effective masses 
m\ and mj, while a subband with a higher energy has the intersection of the two 
ellipses as equi-energy surfaces. 

The difference between the bulk transversal masses (9.9), (9.10) and the subband 
masses in thin films is due to the rapidly growing value of the right-hand side in 
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Fig. 11.14 The dependences 
of the effective masses on 
shear strain for the two lowest 
subbands in a film of a 
thickness? = 10.86 nm 
(symbols). The dependences 
of the transversal masses on 
shear strain in bulk silicon 
(9.9), (9.10) are also shown 
(lines) 
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Fig. 11.15 The dependences of the effective masses on shear strain for the two lowest subbands 
in a film of a thickness / = 5.43 nm 



(11.44) with the film thickness decreased. The right-hand side is exactly zero in 
relaxed films, when either /c^ or ky is equal to zero. This is why the minimum of 
two ground subbands in relaxed films is at the same position kx = ky = 0. As 
soon as shear strain is non-zero or both kx ^ and ky ^ 0, the energies of the two 
ground subbands becomes different. 

In order to justify the dependences of the subband effective masses or, rather, 
of the second derivatives along [110] and [llO] on the film thickness let us solve 
(11.44) by perturbation assuming r] to be small. In the zeroth order the right-hand 
side can be ignored, and one finds y^ = qn = f^ ■ 
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Fig. 11.16 The dependences of the eifective masses on shear strain for the two lowest subbands 
for several film thicknesses 



Fig. 11.17 Ultra thin films 
exhibit different effective 
masses for the two ground 
subbands even without stress. 
The thinner the film the more 
pronounced is the difference 
in effective masses 
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Substituting y'^ = qn = yH ^'^'■° '^^ right-hand side of (11.44) and solving for 
small correction to jn , the dispersion relation for the unprimed subbands n can be 
obtained as 



Et = 



2m] 



\ t ) 2m, \kot) 






knt 



\kot) 



sm{kot). 

(11.48) 
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Fig. 11.18 For a film thickness of 1.36 nm the dispersion of the two ground subbands is shown. 
Without strain the subbands are degenerate at the minimum 



Fig. 11.19 The two ground 
subbands are shown as 
contour plots. The subband 
dispersion of one of them is 
described by the intersection 
of the two ellipses, while the 
other subband dispersion is 
given by the unification of the 
two ellipses 
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(11.48) confirms that the subband degeneracy is preserved only in the absence of 
shear strain and when either kx = or ky = 0. (1 1.48) shows that the unprimed 
subbands for thin films are not equivalent anymore, even in relaxed films. Deriving 
the expression for the effective masses out of (1 1 .48) reveals two independent effec- 
tive masses in [110] direction for the unprimed subbands with the same quantum 
number n without strain: 
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(11.49) 
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This behavior is in agreement with the effective masses found numerically for the 
two ground subbands in relaxed thin films shown in Fig. 11.17. 



11.6 Valley Splitting in Magnetic Field and Point Contacts 

In Sect. 1 1 .4 it was demonstrated that shear strain can be effectively used to increase 
the splitting between the valleys beyond the Zeeman spin splitting. This allows 
effective separation between the valley and spin quantum numbers [12], opening 
silicon to spintronic applications. Experimentally, the valley splitting is studied in 
silicon layers epitaxially grown on (001) SiGe substrates. Because of the lattice 
mismatch, the silicon layer experiences biaxial tension. Tensile biaxial stress partly 
removes the valley degeneracy as it was described in Chap. 9. Experiments show that 
the degeneracy between the two remaining valleys in silicon layers is also removed. 
Existing experimental results for silicon quantum wells can be separated in two 
groups by the methods used to obtained the data. 

• Measurements of the valley splitting in an external magnetic field in the sili- 
con quantum wells grown on SiGe substrates. Due to a mismatch in the lattice 
constants between silicon and SiGe a biaxial tensile stress is created in a (100) 
silicon quantum well. Due to size quantization in the well, the sixfold degen- 
eracy between the bulk conduction band valleys is enhanced. The remaining 
degeneracy between the two lowest unprimed subbands is lifted by applying a 
perpendicular magnetic field [12, 14, 15]. The corresponding splitting deduced 
from the Shubnikov-de-Haas oscillations using a wide Hall bar geometry shows 
a strikingly linear dependence on the magnetic field [12]. This linear dependence 
previously observed at high fields was recently confirmed to exist down to values 
of the magnetic field as small as 0.3 T [12]. The valley or, more correctly, the 
lowest subband splitting is in the range of a few /^eV for small fields [12, 14, 15]. 
This value obtained from experiments is much smaller than the one predicted 
theoretically [5]. 

• The second set of experiments on valley splitting is based on the conductiv- 
ity measurements through a constriction or a point contact. The constriction is 
created by applying a negative bias to the gates patterned on top of the two- 
dimensional electron gas in a quantum well. The potential well gets depleted 
under the gates, which leads to the formation of a point-like contact between 
the two-dimensional electron gas reservoirs. The conductivity of a point con- 
tact is quantized in units of e^ /h at low temperature. The conductivity steps 
correspond to transversal channels and, therefore, can be labeled by valley, sub- 
band, and spin quantum numbers. By carefully analyzing these steps it was first 
demonstrated in [31] that the valley splitting can be a few meV without external 
magnetic field. This value is larger than typical spin Zeeman splitting. Recently, 
a similar study was reported in [12], where a valley splitting of several meV 
was confirmed. It was also observed that the valley splitting can be controlled by 
additional confinement in nanostructures. The additional confinement is achieved 
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by either changing the gate voltage or by an external magnetic field, showing 
that the valley splitting can be controlled through both physical and magnetic 
confinement. 

The valley splitting has been known from the mid sixties [10] and has been the 
subject of investigations since. A comprehensive overview of the earlier theoretical 
models of the electric break-through at the 7^ -point by Ohkawa and Uemura [18,19] 
and the surface scattering theory by Sham and Nakayama [22] is given in the famous 
review of Ando et al. [2]. In the Ohkawa- Uemura theory the intervalley coupling is 
induced by the conduction band bending at the F -point. In the Sham-Nakayama 
theory the coupling is due to possible electron transition from one valley to another 
as a result of electron scattering at the abrupt interface. These two models are con- 
sidered to be not completely independent and are related due to similar expressions 
for the predicted valley splitting [2]. Recently, the theory of intervalley coupling 
has been extended [11] to explain experimental data [12]. Inspired by a first prin- 
ciples theory [9], intervalley coupling was introduced phenomenologically at the 
heterostructure interface. The strength of the intervalley interaction parameter can 
be calibrated to reproduce results from a tight-binding model [5]. 

The most striking feature found experimentally [12] is the difference in scales 
between the two values for the valley splitting obtained from the two experimental 
setups described in the previous section. In order to make the theoretical predictions 
qualitatively consistent with the experimental data, a quantum well with an orienta- 
tion slightly tilted from the (001) orientation was considered in [11]. The quantum 
well developes atomic steps at the interfaces associated with the global tilt [11]. The 
surface disorder is due to a slight misalignment of the wafer from (001) direction. 
For a strong field the magnetic length becomes smaller than the typical thickness 
variation length which results in large valley splitting. According to [12], this is also 
the reason why in a laterally confined electron system of quantum point contacts 
the valley splitting is large. At week magnetic fields the magnetic length extends 
to many steps of thickness variations. An electron is then effectively moving in a 
quantum well slightly tilted from the (001) orientation, where the valley splitting is 
suppressed [12]. In order to recover an experimentally observed linear dependence 
of the valley splitting on the magnetic field strength a particular disorder model for 
step profiles must be adopted [11]. 

An advantage of the two-band k-p model (11.4) is that it provides the value of 
the valley splitting in a confined system and allows to calculate intervalley coupling 
without introducing any additional fitting parameters. As it is shown below, a linear 
dependence of the valley splitting on magnetic field can be obtained from the two- 
band k-p model alone. The model also explains a larger value of the valley splitting 
in a [1 10] oriented point contact. 
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11.6.1 Valley Splitting in Magnetic Fields 

In relaxed (001) silicon films the Landau levels in an orthogonal magnetic field B 
are found from (11 .48) by using the Bohr-Sommerfeld quantization condition: 

E^^^^ = hcoc ( m + -] ^ (11.50) 

V 2/ 4arctan(yOT(i,2)/w(2,i)) 

where the masses are determined by (1 1.49), and 

eB 



^m\m2C 

is the cyclotron frequency, e is the electron charge, and c is the speed of light. 
According to (11.50), the difference \Em — Em \ is linear regarding the magnetic 
field. In Shubnikov-de-Haas experiments there will now occur two sets of resistance 
oscillations with slightly different periods the in inverse magnetic field. Because of 
the small difference between the masses the difference in the periods will also be 
small. However, at the Fermi level the quantum number m, which is proportional to 
the ratio of the Fermi energy to the cyclotron frequency, is typically very large and 
may lead to a splitting of several hundreds /xeV. The difference in the periods can 
be interpreted as an appearance of an additional energy shift between the equivalent 
unprimed valleys. Most importantly, the shift is linear in the magnetic field. The 
linear dependence of splitting between the valleys on the magnetic field will be also 
observed even in the presence of a small intrinsic constant valley splitting as long 
as this splitting is much smaller than the Fermi energy. This splitting is possible due 
to a remaining shear strain and/or conduction band non-parabolicity which is not 
accounted for in the two-band k-p theory (1 1 .4) and is usually several tens of /xeV, 
thus much smaller than the Fermi energy. For a lOnm thick silicon film grown on 
SiGe it follows from (11 .50) that the valley splitting can be several tens of /xeV in a 
magnetic field of 1 T, which is consistent with the experimental observations [12]. 



11.6.2 Valley Splitting in a Point Contact 

We consider a point contact in [110] direction realized by confining an electron 
system of a thin silicon film laterally by depleting the area under additional gates. 
Without strain the low-energy effective Hamiltonian in the point contact can be 
written as: 

/^(1,2) = Z — + - '— + -Kx'^ + Vb (11.51) 

2m(i,2) 2m(2,i) 2 

where the primed variables are along the [110] and [110] axes, the effective masses 
are determined by ( 1 1 .49), k is the spring constant of the point contact confinement 
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potential in [110] direction, and Vj, is a gate voltage dependent conduction band 
shift in the point contact [36]. The dispersion relation of propagating modes within 
the point contact is written 



^(1,2) 



tl^k'^ 



2m 
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+ Vb 



(11.52) 



where ft)(i,2) = k/^(\,2)- Since the energy minima of the two propagating modes 
with the same p are separated, they are resolved in the conductance experiment 
through the point contact as two distinct steps. The valley splitting is AEp = 
hp\a\ — (1)2 1- The difference in the effective masses (11.49) and, correspondingly, 
the valley splitting can be greatly enhanced by reducing the effective thickness t of 
the quasi-two-dimensional electron gas, which is usually the case in a gated electron 
system, when the inversion layer is formed. 

In a [100] oriented point contact without strain the effective Hamiltonian is 
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(11.53) 



Due to symmetry with respect to ky the subband minima in a point contact are 
always degenerate. For this reason the valley splitting in [100] oriented point 
contacts is greatly reduced as compared to point contacts in [110] direction. 



11.7 Primed Subbands in Ultra-Thin (001) Silicon Films 

The shear strain component in the [110] direction does not affect the primed valleys 
along [100] and [010] direction, except for a small irrelevant minimum displacement 
from the [100] or [010] axes [1] due to the term (8.40) in the Hamiltonian. Interest- 
ingly, recent Linear Combination of Bulk Bands calculations [23] and calculations 
based on the density-functional theory [17] uncover the dependence of the transver- 
sal effective mass mt of the primed subbands on the silicon film thickness t. Here 
we analyze the dependence of the primed subbands effective mass via the two-band 
k-p Hamiltonian (11 .4). In the case of the primed subbands in (001) silicon films the 
quantization is done along the kx (or ky) direction. In the Hamiltonian 
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The quantization is along the x-axis, so the substitution kx = —ifid/dx must be 
performed, while the other two components ky and k. are set to be constants. 

The Hamiltonian (11.54) has a structure similar to (11.4): it contains the first 
and the second derivatives. The substantial difference is that the first derivative 
ky/M{—ihd/dx) is multiplied by the variable ky, contrary to the corresponding 
first-order term in (11.4) ko/mi{—ifid/dz), where ko = 0A527t/ao is a constant. 
Let us approximate the confining potential U(x) by a square well potential with 
infinite walls. Because the structure of the Hamiltonians (1 1.54) and (1 1.4) and thus 
the dispersions are similar (in fact, the dispersion is described by the same relation 
(9.4)), one can claim that for any energy E there will be two pairs of the solutions 
for kx which we will also call here ±ki and ±^2- Repeating the calculations pre- 
sented in Appendix 11.9 the relation between the two pairs of roots can be written 
in the form: 



kl(ki) _ k\ kl mj _ Ikl kj m? k} m^ 
The dispersion equations are in the form: 



(11.55) 
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where the functions fpm(k\) are defined as: 



(11.56) 
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After the ladder of solutions k"{ky,k,) is found, the energy of the 
subbands is computed as 



(11.57) 
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(11.58) 



11. 7.1 Effective Mass of Primed Subbands 



We solve the dispersion (1 1.55)-(1 1.58) numerically. Alternatively, like in the case 
of unprimed subbands, we discretize the differential operators kx = —id/dx in 
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Fig. 11.20 Comparison 
between the effective masses 
for the lowest ground 
subband obtained with 
fuUband LCBB calculations 
[23] {filled symbols), 
calculations from [17] (open 
symbols) and the two band 
k-p model {solid line). 
Excellent agreement with the 
full band calculations are 
achieved 
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tlie Hamiltonian (11.48) on a mesh in the x direction and solve the eigenvalue 
problem numerically. Both approaches yield the same results. The dependence of 
the transversal masses of primed subbands in (001) relaxed silicon film is shown 
in Fig. 1 1 .20. The predictions of the two-band k-p theory are in excellent agreement 
with the results of the Linear Combination of the Bulk Bands method with a poten- 
tial barrier of 3 eV at the film interface [23] and are also consistent with the DFT 
calculations from [17]. 



11.8 Substrate Orientations Different from (001) 



There is growing experimental evidence [32,33] that the parabolic approximation is 
not sufficient to accurately describe the subband parameters (effective masses and 
subband energies) in ultra-thin films under uniaxial stress. For instance, the effective 
masses of the unprimed subbands in a (001) film depend on shear strain [33] and 
silicon film thickness [26]. These effects are ignored in the parabolic band approxi- 
mation but well described by the two-band k-p model (9.1)-(9.3) of the conduction 
band as it was demonstrated above. 

We are now going to demonstrate that the two-band k-p model (9. l)-(9.3) allows 
an accurate description of the dependences of the subband energies and effective 
masses in thin silicon films with orientations different from (001). 
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11.8.1 Rotation of the Hamiltonian 



To describe subbands in a film with an arbitrary crystal orientation a proper rotation 
of the Hamiltonian is required. Assuming e,, to be the quantization direction in the 
crystallographic coordinate system, let us define a unitary transformation Uc which 
transforms the quantization direction into Cz = (0, 0, 1)^ 



Cz = U„ ^e„. 



(11.59) 



Rotation of the wave vector k i-^ Uck with k = {kx , ky , k-y is required. When 
the direction of the quantization direction en in the crystallographic system is 
determined by the spherical angles (p and ??, the transformation from the crystal 
coordinate system to the coordinate system is written as 



Ue 



cos((p) cos(!?) — sin((p) cos(^) sin(i?) 

sin(^) cos()?) cos((p) cos(^) sin(!?) 

-sin(!?) cos(!?) 



Furthermore, the different valley types along y = x, y, z axes need to be considered. 
Using the transformations 
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the total transformation matrix for valley v is given by [3] 

U„ = Uv,„Uc. (11.60) 

To take strain into account for each valley type, the strain tensor is transformed by 

£h^Uj„eUv,„. (11.61) 

Therefore, the diagonal blocks of the two-band k-p Hamiltonian (11.4) for arbitrary 
substrate orientation and valley type v are rewritten as 
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The off-diagonal coupling components are transformed 



11.8 Substrate Orientations Different from (001) 



159 



H, 



off,v 



De 



xy 



k^u! 





M 


M ^ 











U„k. 



To find subbands, the replacement k, —>■ — i d^ is introduced. The discretization 
is realized with a finite difference scheme with hard wall boundary conditions. The 
resulting eigenvalue problem gives rise to discrete energies describing the subband 
structure. 



11.8.2 Thin (110) Oriented Silicon Films 

To describe the subbands in a film with (110) crystal orientation we appropriately 
rotate the Hamiltonian for each pair of the valleys and resolve the subband structure 
numerically. The zero boundary conditions for the wave functions at the interfaces 
are applied. 

Figure 11.21 demonstrates the thickness dependence of the effective mass of 
the ground unprimed subbands of (110) films in the [010] direction. Results are in 
agreement with those obtained by the pseudo-potential method [23]. The effective 
mass increase indicates that transport properties in ultra-thin (110) films in [010] 
direction are degraded with decreasing thickness t . 

Figure 1 1 .22 shows the subband energy dependence of the film thickness for the 
lowest primed and unprimed subbands. Surprisingly, in ultra- thin (110) films the 
twofold degenerate subband becomes lower in energy than the fourfold degener- 
ate one. This is a direct manifestation of the increased non-parabolicity [3] of the 



Fig. 11.21 Effective mass in 
[010] direction dependence 
on thickness for the fourfold 
degenerate ground subband in 
a (1 10) film. The results 
obtained with help of k-p 
model are compared against 
the results of pseudo-potential 
calculations [23] 
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Fig. 11.22 Dependence of 
the primed and unprimed 
subband energies on the film 
thickness ( in (1 10) films. 
Analytical results (1 1.62) and 
(1 1.63) are shown by lines 
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[001] valleys in the [110] quantization direction [32]. The twofold degenerate sub- 
bands are characterized by the lighter mass w, in [110] direction. Thus, the [110] 
channel direction becomes beneficial in FinFETs with ultra- thin bodies. Analytical 
results for the twofold degenerate subbands also shown in Fig. 1 1.22 are obtained 
by substituting kx by its quantized values kx = Tin/t in the dispersion 



l^n\K-y t l^z) 







(11.62) 



where, as before, ko = 0.152:/r/ao, Sxy denotes the shear strain component in the 
crystallographic system, and D = 14eV is the shear strain deformation potential. 
The analytical expression for the fourfold degenerate subbands are obtained with 



l^nKi^y^ i^z) — 
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(11.63) 



with the corresponding quantization mass [24]. Figure 11.23 demonstrates that 
(1 1.62) describes accurately not only the minimum position but also the dispersion 
in [110] {ky) direction of the twofold degenerate unprimed subbands in a f = 3 nm 
thin relaxed silicon film. 

Tensile uniaxial stress along the channel enhances electron transport in [110] 
direction. In a (110) thin film the uniaxial stress favors to altering the relative 
energy order of primed and unprimed subband ladders. Due to an increase of the 
quantization mass nix in the [110] direction for increased tensile strain and higher 
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Fig. 11.23 Twofold degenerate subband dispersion in the [110] direction for a (110) film with 
f = 3 nm. Lines are obtained with (1 1.62) 
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Fig. 11.24 Crossing between the first primed and unprimed subbands in a ; = 3 nm (110) film as 
a function of uniaxial stress in [1 10] direction 



non-parabolicity at reduced thiclcnesses this transition happens at low stress val- 
ues as compared to the case when these effects are ignored by setting l/M = 0, 
as shown in Fig. 11.24. The effective mass in the transport direction is, how- 
ever, reduced with increased stress for decreased film thickness t as demonstrated 
in Fig. 11.25, which results in higher mobility. Thus, we conclude that FinFETs 
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Fig. 11.25 Dependence of 
the transport effective mass in 
the first twofold degenerate 
subbands on tensile stress 
along the [110] direction, for 
several thicknesses of (1 10) 
films 
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with ultra-thin body tensely stressed along the [110] channel have superior trans- 
port characteristics over [010] FinFETs because of the subband ladders inversion 
and a lighter effective mass of the lowest subband [25]. This behavior was recently 
observed experimentally [6]. 



11.9 Appendix 



11.9.1 Re-Expressing Xi as a Function of X2 



Following dimensionless units are introduced: 



ko "" 

E s 

Eo ' ^0 



(11.64) 



The energy dispersion (1 1.7) takes the form: 



X^ I mi(kl + kl) 
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(11.65) 



Setting the determinant of the dimensionless Hamiltonian to zero gives X as a 
function of energy e: 
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(11.66) 



or [\-^) -X^-n^ = 0. 



Introducing 



^ = X^, (11.67) 

allows to re-express the fourth order equation (1 1.66) as a second order equation 
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The solution are 



^-^s + s^-^-r]^ = (11.68) 



^ = 2(1 + e) ± V'4(l + e)2-4(e2-?72) or 

^ = 2(1 + £) ± 2/rT2eT^. (11.69) 



The solutions can also be written as 

,2 



^= (li ^/l + 2s + riA -rf- (11.70) 

Thus, for X values one gets the following expressions: 

Xl = (\ + ^\ + 2e+ri^^ -rf- (11.71) 

Xl = (\-^fV+7l^^^ -rf. (11.72) 

Using the following identities 

J^2 I j^2 

' ^ ^ =2(l + £), (11.73) 

^ ^ =lsl\ + ls+rf (11.74) 

results in the desired expressions X\{X-i) and X^iXx): 

Xl=i^^llzJb\ _rf^ (11-75) 

Xl=\\ ^ ~ ^ I -??2 or (11.76) 
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Xf = Xi + 4 + 4^X^ + r]^, (11.77) 

Xi = X^ + 4-4^Xf + ri^. (11.78) 

The corresponding expression for the function c{X) in dimensionless units is 

c{X) = , (11.79) 

r]±y/r]^ + X^ 



11.9.2 Expressing the Dispersion Equations 
in Terms of Xi ± Xj 

Let us define the following variables: 

Xi + X2 

yn= \ , (11.80) 



Xi -X2 

yn= \ ■ (11.81) 



By using (11.75), (11.76) one obtains: 



1± \ J ' ^^^-^2) 



which is rewritten as 



It then follows that 
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Thus, y„ is a function of j„ . Similar, _y„ is a function of jn : 
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Let US express the fraction F = cix )~+c(x ) ™^ (1 1-42) in dimensionless units: 






F= ) { ) : ■ (11.89) 



Substituting Xi with (11.75) and X2 with (11 .76) and using the identity JxfJ+^= 
|i± ^4 '^ \=\\±ynyn\ one rewrites F as a function of jn and j^: 

p _ {yn- yn) (>7 ± (1 + y»y»)) - jyn + jTi) (?? ± (1 - ynyn)) (1190) 

(j« - Jn) (/? ± (1 + Jnjn)) + (j„ + >>„) (?? ± (1 - J^Jn)) ' 

Further simplifications result in: 

v„ (n ± 1 T J^) 
j„ (^ ± 1 T j2) 

Substituting (11.87) into (11.91) leads to 

F= ^^ ^"^ . (11.92) 
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which gives the corresponding term in (1 1 .44): 



1 = ^^^ ^"^ ^. (11.93) 
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Chapter 12 

Demands of Transport Modeling in Advanced 

MOSFETs 



12.1 TCAD Tools: Technological Motivation and General 
Outlook 

Integrated circuits (IC) play a key role in modern digital information society. 
Superior computational performance is achieved by making transistor faster and 
assembling more and more elements on a chip. This is achieved by scaling the 
MOSFET size down. In the past decade the minimum feature size of transistor has 
been successfully reduced which allowed to double the number of transistors on a 
chip every second year. This trend is expected to continue in the next decade, as 
predicted and institutionalized by the International Technology Roadmap for Semi- 
conductors [1] and supported by demonstration of MOSFETs with the gate length 
as short as 6 nm [28]. 

The success of microelectronics technology is partly enabled and supported by 
sophisticated Technology Computer-Aided Design (TCAD) tools which are used 
to assist in IC development and engineering at practically all stages from process 
definition to circuit optimization. At the current moment, the TCAD tools allows to 
reduce research and development costs by approximately 35^0% [1]. Most TCAD 
tools are based on semi-classical macroscopic transport models. From an engineer- 
ing point of view, semi-classical models, like the drift-diffusion transport model, 
have enjoyed an amazing success due to their relative simplicity, numerical robust- 
ness, and the ability to perform two- and three-dimensional simulations on large 
unstructured meshes [150]. However, with device size dramatically reduced and new 
technology elements and materials introduced, the TCAD tools based on a standard 
semi-classical transport description begin to show shortcomings. 

From the viewpoint of transport modeling, the problem is twofold. First, with 
the downscaling the driving field and its gradient increase dramatically in the 
short channel. As a result the carrier distribution along the channel can no longer 
be described even by the shifted and heated Maxwellian distribution. In order to 
properly account for hot-carrier and non-local effects, the drift-diffusion and even 
the energy transport model have to be improved to incorporate the substantial 
modifications in the distribution function. 
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The second reason for semi-classical modeling tools to gradually loose their 
validity lies in the particle-wave duality of carriers. The carrier motion can be 
described with the classical Newton law only, when the characteristic size of the 
device is much larger than the corresponding de-Broglie electron wave length. When 
the device dimensions are getting comparable to the carrier wave length, the carri- 
ers can no longer be treated as classical point-like particles, and effects originating 
from the quantum-mechanical nature of propagation begin to determine transport in 
ultra-scaled devices. 

A typical example of quantum effects in MOSFETs is the quantization of carrier 
motion in the potential well of the inversion layer at the silicon/dielectric inter- 
face. The size of the channel is usually in the order of a few nanometers, which 
is also the size of the electron de-Broglie wave length. In this confining potential 
only levels with certain, quantized energies are allowed. This results in the forma- 
tion of subbands, as it was shown in Sect. 1 1 and, therefore, transport can no longer 
be accurately described by the classical equations in three dimensions, and a new 
description based on two-dimensional subbands must be adopted. 

Another quantum-mechanical effect in modem MOSFETs is the tunneling of 
charge carriers through classically forbidden regions. Tunneling leads to increas- 
ing leakage through thin gate dielectrics. This gate leakage is responsible for an 
increased power consumption in modern MOSFETs. In 65 nm MOSFETs the gate 
dielectrics are already so thin that the leakage current leads to critically high values 
of generated power. The conventional thinning down of gate dielectrics becomes 
no longer possible, and new technological solutions must be introduced to continue 
scaling and keeping the heat generation within reasonable limits. The use of alter- 
native gate oxide materials with higher permittivity (high-k dielectrics) and metal 
gates [119, 122, 132] Natarajan, S. helps significantly reducing the gate leakage 
current and allowed to successfully continuing scaling to the 45 nm and the 32 nm 
nodes. Strain introduced at 90 nm [ 1 76, 1 77] as an important MOSFET drive current 
and performance booster is perfectly compatible with the new high-k/metal gate 
technology [122, 132]. 

Several options for future technology nodes are currently under extensive investi- 
gation. One of them is multi-gate FinFET device architectures with improved chan- 
nel control and reduced short-channel effects can eventually be employed [129]. 
Another option is to replace silicon in the channel with an advanced material char- 
acterized by substantially higher mobility [142, 143]. Both options are compatible 
with high-k/metal gate technology and strain-induced current boost. 

Tunneling is not only affecting the gate leakage currents. In ultra-scaled 
MOSFETs with gate lengths of lOnm and smaller quantum-mechanical tunneling 
under the barrier in the transport direction from source to drain becomes impor- 
tant [170]. When carrier scattering is neglected, a coherent transport approach based 
on the solution of the Schrodinger equation provides an accurate description for 
quantum-mechanical transport. This method is fast and efficient and can be imple- 
mented into commercial TCAD tools. However, since all devices operate at room 
temperature, carrier scattering in silicon-based FETs is still important, and trans- 
port is significantly affected by scattering even in ultra-scaled FETs [134]. Recent 
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Studies indicate that the crossover from diffusive to ballistic transport in nano-wire 
transistors may occur at a much shorter distance than previously anticipated [54]. 
An adequate transport model for ultra-scaled MOSFETs must therefore account for 
quantum-mechanical coherent and dissipative scattering effects simultaneously. 

Modern TCAD tools have to be flexible enough to address challenges due 
to upcoming technological changes resulting from the use of new materials and 
structures. They have to describe properly transport in silicon and new advanced 
materials depending on strain and must be prepared to adequately describe the 
new quantum-mechanical phenomena which will determine transport in ultra-scaled 
CMOS and post-CMOS devices. 



12.1.1 Brief History of TCAD Transport Modeling 

The first fully numerical transport description was already suggested in 1964 by 
Gummel [64] for the one-dimensional bipolar transistor. The approach was fur- 
ther developed and applied to pn junctions [24] and to impact ionization avalanche 
transit- time diodes by Scharfetter and Gummel [148]. The first application of a solu- 
tion of the two-dimensional Poisson equation to metal-oxide-semiconductor (MOS) 
structures was performed by Loeb [108] and Schroeder and Muller [149]. The 
first simultaneous solutions of coupled continuity and Poisson equations applied to 
junction field effect transistors [84] and to bipolar transistors [159] go back to 1969. 

Since these pioneering works on transport modeling many different approaches 
have been applied to practically all important devices, and the number of papers in 
the field has grown exponentially. Today the modeling of transport in modern ICs 
has matured into a well established field with active research, intensive software 
development, and vast commercial applications. Many textbooks, monographs, and 
reviews devoted to theoretical and computational aspects of transport modeling 
in ICs have been published. Probably the first monograph covering practically all 
aspects from modeling and discretization to applications is [150]. Not pretending to 
cover all the literature we mention only the most recent monographs relevant to the 
review. Various transport models were summarized in [113], while new approaches 
to transport are well described in [22]. 

As the costs of development and maintenance of modern sophisticated TCAD 
software have significantly increased, only few large semiconductor companies can 
afford to support their own TCAD development team. There is a fairly large number 
of commercial TCAD software products available on the market [158, 174] which 
serve most of the industrial demands. Numerous TCAD tools developed at uni- 
versities have the advantage that they are freely distributed software [2,70, 179]. 
These tools regain their popularity due to the concept of a complex approach to 
simulations based on different levels of precision and complexity. It makes these 
tools valuable not only for pure educational and research purposes. Semiconduc- 
tor manufacturing companies which need more refined simulations of complex 
phenomena often consult with researchers from universities to verify their data 
obtained experimentally. 
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12.1.2 Transport Modeling: Formulation of the Problem 

In order to analyze a semiconductor device under general operating conditions, a 
mathematical model has to be formulated first. Regardless of the complexity of car- 
rier dynamics inside the semiconductor, two equations are an important part of any 
model and always have to be included. The Poisson equation relates the density 
p{r, t) of charged carriers in the device to the electrostatic potential <p{r, t): 

div{Kgmd4>ir,t)) = -p(rj), (12.1) 

where ic is the dielectric permittivity. The Poisson equation must be supplemented 
by the proper boundary conditions, which usually are of the Dirichlet or Neumann 
type. The Dirichlet boundary condition fixes the potential at the domain boundary 
and is conveniently applied to model the interface between an external electrical 
contact and the device. The Neumann boundary condition sets the normal compo- 
nent of the electric field at the interface E„ = — (n, grad^(r)) to zero in order to 
isolate the simulation domain from the dielectric environment. 

The second equation is the continuity equation, relating the current density j(r, t) 
to the time derivative of the charge density: 

9p(r, t) 

-^^=divj(r,r). (12.2) 

at 

In semiconductors two types of particles carrying positive (holes) and negative 
(electrons) charges are present. By introducing the electron «(r, t) and hole p{T, t) 
concentrations, the continuity equation can be written for each carrier type as: 



9«(r,f) 
-q — - — = divie(r.t)-qR(r,t), 

dp(r, t) 
+q 3^ = divjp(r,f) + qR{r,t), 



(12.3) 



where R(r, t) is the electron-hole generation rate. 

In order to form a closed set of equations and complete formulation of the mathe- 
matical model, the system of (12.1) and (12.3) must be supplemented by the material 
properties relating the current density j(r, t) to the electron «(r, t) and hole p{r, t) 
concentrations and the electric field E(r, t) = — grad<^(r, t). Derivation of these 
relations is a challenging task which requires an accurate consideration of physical 
mechanisms responsible for carrier motion inside the device. 

Although we do not have the intention to completely cover this enormously 
large field of research here, we present several important examples and outline 
some difficulties and challenges in transport description in modern MOSFETs with 
strain. We begin with the semi-classical description of carrier dynamics inside the 
device, which is justified when the characteristic device size is much larger than the 
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corresponding electron wave length. In ultra-scaled devices quantum effects start 
playing an important role. Different types of quantum potential and density gra- 
dient corrections can be introduced. A more refined description treats the carriers 
motion quantum-mechanical in the confinement direction resulting in the subband 
description of carriers in surface layers and single- and double-gate structures. 
Since the channel length in MOSFETs is large compared to the scattering length, 
a semi-classical transport description can be applied the current direction. How- 
ever, transport descriptions based on a fully quantum-mechanical approach dealing 
with the dissipative quantum transport will be needed in ultra-scaled devices. Begin- 
ning with the description of ballistic coherent transport, the methods based on 
the Non-Equilibrium Green's Function (NEGF), on the density matrix and the 
Wigner function to include scattering are briefly discussed. For quantum devices 
it is sometimes easier to compute the total current without resorting to the continu- 
ity equations (12.3), however, for any transport model, classical or quantum, (12.3) 
must always be satisfied. A particular emphasis will be put on inclusion of strain 
effects into the transport simulations. 



12.2 Semi-Classical Transport 

The system of an ensemble of classical particles is conveniently described by the 
single-particle distribution function /(r, k, f ) in phase space, formed by position r 
and momentum (^k). It can be shown [82] that in case of slow varying perturbations 
in space and time the distribution function satisfies the Boltzmann equation [150]: 

'^+v.V./+^E.V./=(|)^ , (12.4) 



dt " fi 



'coll 



Here, v = Vki?(A:) is the carrier velocity, m is the effective mass. To avoid the 
confusion with the electric field E, the dispersion is denoted as S' = E{k) = 
h^k^ I {2m) in this section. The sign function .Si; distinguishes between negatively 
charged electrons .y^ = —1 and positive holes, 5^, = 1. The right-hand side in (12.4) 
represents the collision operator due to phonons, impurities, interfaces, and other 
scattering sources. 

For realistic structures, a direct numerical solution of this equation by discretiza- 
tion of the phase space is computationally expensive. This is why the TCAD tools do 
not usually solve the Boltzmann equation and are based on simplified transport mod- 
els. Approximate solutions can be obtains by using the method of moments of the 
distribution function. Defining the moments of the distribution function /(r, k, t), 
one consecutively obtained to the drift-diffusion model [64], the energy-transport 
models [164], or the six-moments transport model [58]. Transport models based on 
the moments of the Boltzmann equation are well accepted in TCAD. 

Transport models uses several material characteristics as input parameters. For 
the drift-diffusion model the mobility represents the most relevant parameter [150] 
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relating the drift velocity to the driving force. Mobility dependence on material, 
structure, and physical effects, like the remote Coulomb scattering or soft-phonon 
scattering at the interface of high-^ materials, must be quantified. Demands on 
mobility modeling also arise from currently investigated effects to increase tran- 
sistor on-current. Such effects are for instance mobility enhancement in strained 
silicon channels, possibly in combination with a favorable substrate orientation and 
channel orientation. 

TCAD tools are using simplified transport models, because solving the Boltz- 
mann equation with a Monte Carlo technique is computationally very expensive. 

Although computationally expensive to obtain the device current-voltage char- 
acteristics, a solution of the Boltzmann equation by Monte Carlo methods is conve- 
nient for calibration of the transport models based on the moments of the distribution 
function. In particular, a Monte Carlo solution of the Boltzmann equation is neces- 
sary to obtain the accurate closure relations, expressing higher moments via the 
moments of lower order. The solution of the Boltzmann equation with the Monte 
Carlo technique is also useful, because it allows to introduce and control scatter- 
ing mechanisms at the microscopic level and, most importantly, to incorporate the 
peculiarities of the semiconductor band structure modification with strain. 

Although the drift-diffusion and energy-transport models have been used for a 
long time, calibration of the model parameters to incorporate strain is needed. With 
down-scaling of device feature sizes the drift-diffusion and even energy-transport 
models are gradually losing their validity due to large driving electric fields chang- 
ing rapidly within the devices giving rise to hot-carrier and non-local effects. Thus, 
an extension of the models to the computationally more sophisticated transport 
model is necessary. The derivation of the most advanced semi-classical transport 
model, the six-moments model, is sketched next. 



12.2.1 From Drift-Diffusion to Higher Moments Equations 

The drift-diffusion model has been very successful due to its efficiency and numer- 
ical robustness. These properties make feasible two- and three-dimensional numer- 
ical studies of transport in fairly complex device structures. The robustness comes 
from the fact that in this approach the current density is given by a flow with the 
gradient of the quasi-Fermi level as the driving force. One should keep in mind 
several shortcomings of this model when it is applied to miniaturized devices: the 
hot-carrier effects described by the dependence of mobility on the driving field are 
difficult to include correctly and the non-local effects such as velocity overshoot are 
completely neglected. Higher-order transport models like the hydrodynamic trans- 
port [11] and the energy transport [164] models are designed to overcome some of 
these shortcomings of the drift-diffusion model. The energy-transport model addi- 
tionally takes into account the carrier energy balance. There are, however, several 
problems with the energy-transport model to be used in TCAD applications. One 
of them is that it typically tends to overestimate the non-local effects and thus the 
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on-current of a device. It has been demonstrated that, with the heated MaxwelHan 
assumption [58] impHcit in the model, the high energy tail of the carrier distribu- 
tion is considerably overestimated. This may result in unacceptable errors in the 
estimation of the hot carrier induced gate tunneling current [52]. Another exam- 
ple is poor description of transport in partially-depleted silicon-on-insulator (SOI) 
devices. Because of hot carrier diffusion into the floating body of the device over- 
estimated the energy-transport model may even fail completely in predicting the 
device characteristics [61-63], and the application of transport models including 
higher order moments is required. 

Recently the six-moments transport model has been introduced [58, 60, 61, 95]. 
This model includes additional information on the shape of the distribution function 
as compared to the energy transport model, while it is still computationally far more 
efficient than the solution of the Boltzmann equation by Monte Carlo methods. The 
derivation is based on equations for statistical averages defined as 

^=^j<P{^)f{r,kj)d^k, (12.5) 

where ^(k) is a weight function in k-space. For simplicity we assume that the 
energy band is isotropic and parabolic, although generalization to a non-parabolic 
energy band is possible [56]. In order to derive the six-moments model, the follow- 
ing weight functions are chosen: 

00 = 1, 02 = S, 04 = S"^, 

(12.6) 

01 =fik,03 = \(^,(p5 = yS^- 

Taking the moment of the Boltzmann equation gives the following general moment 
equation, 

^^ + Vr-(v®0y)-^,qE-(Vp®0,) = y"d3^0y f^") , (12.7) 

where (8* denotes the tensor product. In order to obtain a closed set of equations 
for moments several approximations have to be introduced. One is concerned with 
the moments of the scattering integral, which are frequently approximated using a 
macroscopic relaxation time expression: 

/d't*,m ,_MH^. ,,,8, 

J V O? /coll ^<f 

The distribution function can be separated into a symmetric part and an antisym- 
metric part. In the diffusion approximation it is assumed that the antisymmetric part 
is small compared to the symmetric part. It implies that the displacement in ^-space 
of the distribution function and correspondingly the drift velocity is small. As a 
consequence, the symmetric part of the distribution function depends only on the 
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absolute value of k, and the distribution function can be written as: 

f{k) = fs{\k\) + fA{k). (12.9) 

With this assumption all tensor-valued averages become scalar quantities. It can 
be argued that at technical frequencies the time derivatives of the fluxes can be 
neglected. As a result one obtains the following balance equations 

M + v(v) = 0, (12.10) 

-^— + V-{\S')-SyqE-{\) = , (12.11) 

at xg 

—- h V ■ (vf? ) -.Sy 2qE- (uff) = , (12.12) 

at Xg2 

and the following flux equations 

^V(^)-^yqE(l) = -m, ^, (12.13) 

-V{S'^)-Sy-(\E{S) = -my^—^, (12.14) 

3 3 xs 

-V(<f3)-5, -qE(fr2) = _,„^Al£i. (12.15) 

3 3 XK 

In order to obtain the six-moments model we introduce the following quantities, in 
analogy to the energy transport models: 

(l) = y, {S) = luvTy, (^2^ = 1^ k^ V r, 0,, (^3^ = l^k^vM6 
z 4 o 

(v) = ^, (v<f) = S,, (v^2^ = K„. 

Here, v is the carrier concentration, T^ denotes the carrier temperature, Jv is the 
electrical current density, and Sv stands for the energy flux density. As compared 
to the energy-transport models, for the six-moments model, the new variables are a 
second order temperature 0^, the moment of sixth order Me, and a flux Ky related 
to the kurtosis of the distribution function. With generation-recombination terms 
added, the balance equations for the new variables become 

y-i. = -Sy(\{^-^ + RvY (12.16) 
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V ■ S, = -C4 ^^^ + E ■ J, - C4 y ^' ~ ^^ + Gg„ (12.17) 

ot Xg 

V-K„ = -C5 h2 5vqE-S;; -C5 V + G©^, (12.18) 

at X0 

C4=^kB, €5 = ^4. (12.19) 

We note that generation-recombination terms may depend on both electron and hole 
distribution functions in an integral, non-local manner [150], which makes the task 
of solving the corresponding equations extremely difficult. Therefore, generation- 
recombination terms have to be modeled carefully using knowledge from the solid- 
state physics of semiconductors and may represent a significant challenge [59]. 

The system of equations for the densities must be completed with the flux 
equations, which are written as 

J, = -Ci (v(vr,)-^„-^Ei;J, Ci=.SvkBAtv, (12.20) 

S. = -C2 (v(vr,6)v)-iv^EvnV €2=^-^—11,, (12.21) 
V ke 7 2 q Tm 



K^ = -C3 ( V(i;M6)-^v^Evr„6)„ V C3=^^— M,, (12.22) 



::v = -C3 h(v 



where the mobility 



4 

kfi ' '" "7 " 4 q T„ 

Mv = qrm/mv (12.23) 



is introduced. 

The drift-diffusion transport model consists of the continuity equation (12.16) 
and the current relation (12.20). The latter is decoupled from the higher order equa- 
tion by introducing a closure assumption for the second order moment, T^ = Tl. 
The physical meaning of this assumption is that the carrier gas is in equilibrium with 
the lattice. 

The energy-transport model additionally takes into account the carrier energy 
balance equation (12.17) and the energy flux equation (12.21). To close the sys- 
tem of equations, an assumption on the fourth order moment has to be introduced. 
The assumption of a heated Maxwellian distribution for the symmetric part of the 
distribution function gives the closure relation 0y = T-^. 

Going one step further in the model hierarchy one obtains a transport model 
of sixth order. A balance equation for the average squared energy (12.18) and 
the related flux equation (12.22) are added. To close the equation system, the 
moment of sixth order Me has to be approximated using the lower order moments. 
For a Maxwellian distribution function and parabolic energy bands one would 
obtain Me = T^ . However, as the six-moments model does not require the heated 
Maxwellian approximation by treating the kurtosis of the distribution function as an 
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unknown, a more general closure relation is desirable. Sonoda et al. [162] proposed 
a similar six-moments transport model, however, with a very restrictive closure rela- 
tion. In [57] an empirical closure relation has been proposed taking into account also 
the second order temperature 0y . 



-Kiy- 



Ms = T^ ( ^1 . (12.24) 

From Monte Carlo simulations, which are an accurate reference, the value of 
c = 2.7 has been estimated [55]. Compared to the energy-transport models, the 
six-moments model requires two additional relaxation times, namely the relaxation 
time of the second order temperature r©, and the kurtosis flux relaxation time xk- 
Since analytical models for these new parameters are not available, tabulated values 
obtained from bulk Monte Carlo simulations [55] can be used. 



12.2.2 Model Verification 

In order to investigate the validity of macroscopic transport models we have car- 
ried out extensive Monte Carlo simulations of transport through a silicon n -doped - 
intrinsic - /i-doped (n — i — n) structure. The length of the two heavily doped 
contacts {No = 10^" cm"-') is kept constant, while the length of the intrinsic chan- 
nel (A^^ = 10^^ cm~^) varies. The band structure of silicon is approximated by 
six valleys with parabolic dispersion relation. Electron scattering with acoustic and 
optical phonons as well as with ionized impurities is taken into account. The steady 
state Boltzmann equation can also be conveniently solved by expanding the angu- 
lar dependence of the distribution function /(r, k) on k using a complete set of 
spherical harmonics ¥[^{0, (p): 

/(r,k) = Y.fim(r,k)Yi^(e,cP), (12.25) 

Im 

where the and <j) are the polar angles between the electric field E and k. In the 
low-field limit one can truncate the expansion (12.25) after the terms with / = 1, 
which in case of parabolic isotropic bands and randomizing elastic scattering results 
in a drift-diffusion transport model with low-field mobility. As is shown in [67], in 
case of elastic scattering this approximation gives good results for silicon where the 
valleys are not isotropic. For general scattering processes and realistic band struc- 
tures as well as at higher driving fields more terms in the expansion (12.25) are 
needed [135]. 

The average velocity determines carrier transport in the structure. The average 
velocity as a moment of the distribution function can be computed along the device 
and is thus a more sensitive measure of validity for a particular transport model than 
any integral characteristic of a device like the total current. A typical result for the 
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Fig. 12.1 Average velocity along the n —i —n structure computed by solving the Boltzmann trans- 
port equation with the Monte Carlo method and using the spherical harmonic expansion method, 
with different numbers of terms included in the series (12.25). It is demonstrated that even in 
a short-channel structure the series expansion with terms only up to / =9 included provides 
excellent results as compared to the more time consuming Monte Carlo data 



average velocity of carriers along an n — i — n structure with an intrinsic region 
length of 40nm computed with the Monte Carlo method is shown in Fig. 12.1. The 
solution of the Boltzmann equation by spherical harmonic expansion, with different 
number of terms in the series (12.25) is also displayed in Fig. 12.1. In such a short 
device the series (12.25) truncated at / = 1 gives only a poor approximation for 
the velocity profile. At the same time it is demonstrated that the spherical harmonic 
expansion (12.25) including terms up to / = 9 gives a perfect result. The inclusion 
of terms with / > 9 into (12.25) does not change the velocity profile confirming the 
rapid convergence of the series (12.25). Therefore, the spherical harmonic expansion 
method gives excellent results with only a few terms included into (12.25) even for 
short-channel devices. For this reason the method is much less time consuming than 
the Monte Carlo solution of the Boltzmann equation. However, Monte Carlo data 
are needed in order to validate the applicability of the spherical expansion method 
to describe transport in short-channel devices. 

Current-voltage characteristics computed with the Monte Carlo (identical to 
those computed with spherical harmonic expansion method) and using the macro- 
scopic transport models based on the moments of the distribution function are shown 
in Figs. 12.2 and 12.3, for several channel lengths. It is demonstrated that for long 
devices (Ijim) the drift-diffusion, the energy transport model, and the six-moments 
model give almost equivalent results which are in perfect agreement with the results 
of the spherical harmonic expansion method. For a device with L^-h = 250 nm the 
drift-diffusion model underestimates the current. Since the carrier temperature is 
constant, the drift-diffusion model does not account for any non-local effects and 
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Fig. 12.2 Current-voltage characteristics for devices with different channel length computed 
with the spherical hannonic expansion method and using macroscopic transport models based 
on moments of the distribution function. Here DD stands for the drift-diffusion model, ET 
for energy-transport model, SM for the six-moments model, and SHE for spherical harmonic 
expansion 





' 1 ' 1 ' 


1 ' 1 ' ^i 












0.04 




DD 

--- ET 










SM 




y 


0.03 


- 


o SHE 


/ 


Y 




y 




a 


• 


Q-' 


<: 


^ 


^^^ 


^ 0.02 


y^ 


^^^^^ - 


-o 


y 












/ .■■■' o 


^^^^ 






40 nm 


0.01 


' .-' ^^-''"'''^ 






/ 0^^^ 




.Q--.r.r.^-^ 




' 1^^ -r.' --Tl^ElI^ 


-.r.a-r--- 


80 nm 


oJ 


C^-^ 1 , 1 


1 1 1 1 1 



0.2 



0.4 0.6 

Vd/Lch [V] 



0.8 



Fig. 12.3 The same as in Fig. 12.2 for shorter devices. The six-moments model gives results 
closest to the results of Monte Carlo and the spherical Harmonic expansion methods 



cannot capture the non-local transport inside short-channel devices. This causes 
the accuracy of the drift-diffusion model to decrease for gate lengths shorter than 
250 nm, where the restriction of constant carrier temperature must be relaxed. Due to 
the temperature gradient, heat flow and thermal diffusion appear. The drift-diffusion 
transport model must be augmented with the energy flow. 
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Fig. 12.4 Average velocity along a device with 40 nm channel length computed with the 
macroscopic transport models and with the spherical harmonic expansion method. While the drift- 
diffusion model underestimates the velocity and cun'ent, the energy-transport model overestimates 
the velocity in short-channel devices. The six-moments model gives hest results as compared to 
the velocity obtained by the spherical harmonic expansion method 



The energy-transport model additionally akes into account the energy flux equa- 
tion and the carrier energy balance equation. The model, however, overestimates the 
drive current as it was mentioned before. Figure 12.4 illustrates the average velocity 
profile in a device with a 40 nm long channel. The drift-diffusion model underes- 
timates the average velocity while the energy-transport model overestimates it. In 
order to reduce this spurious velocity overshoot effect the next moments should be 
included for devices with Lc/, shorter than 100 nm. 

This is accomplished by introducing a transport model of sixth order. A balance 
equation for the average squared energy and the related flux equation are added. To 
close the equation system, the moment of sixth order has to be approximated using 
lower order moments. From Monte Carlo simulations serving as an accurate refer- 
ence, an empirical closure relation has been derived (12.24). The inclusion of higher 
moments improves the transport model significantly. The current- voltage character- 
istics are reproduced fairly well even in devices as short as 40 nm as demonstrated 
in Fig. 12.3, because of the more accurate results for the velocity (Fig. 12.4). The rel- 
ative error of the current computed by the macroscopic transport models presented 
in Fig. 12.5 as function of the channel length clearly demonstrates the applicability 
of the six-moments model to describe the output characteristics of devices with a 
channel as short as 40 nm. 

Since the energy-transport model takes into account the energy flux equation in 
addition to the carrier energy balance equation, it requires modeling of two mobili- 
ties for the current density and the energy flux for each carrier type, one relaxation 
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Fig. 12.5 Relative error of the current computed with macroscopic transport models. While the 
drift-diffusion and the energy-transport models are gradually losing their validity with the channel 
length reduced, the six-moments model maintains its accuracy down to the channel length of 40 nm 



time, and the non-parabolicity factor for non-parabolic bands. The six-moments 
model requires two additional relaxation times for the second order temperature 
and the kurtosis flux. Having too many adjustable parameters is a particular inconve- 
nience of the six-moments model. A solution to this problem is based on tabulating 
some parameters of the model using Monte Carlo simulations. The parameter 
dependences on temperature, doping, and driving field are determined from the con- 
dition that the six-moments transport model reproduces exactly all the six moments 
obtained from the Monte Carlo simulator under homogeneous conditions. Due to 
calibration of all these parameters, a generalization of the six-moments transport 
model to include strain is still pending. 



12.3 Mobility in Strained Silicon 



The transport models discussed in the previous section contain several parameters 
which have to be accurately modeled for the purpose of predictive simulations. The 
drift-diffusion model described by (12.16) and the current relation (12.20) requires 
the carrier mobility (12.23) as such a parameter. The mobility relates the drift veloc- 
ity to the electric field and depends on scattering mechanisms encoded into the 
momentum relaxation time r as well as on the band structure via the effective mass 
m. Knowledge of the mobility dependence on temperature, doping, stress, substrate 
orientation, and driving electric field allows to investigate device performance in 
different regimes for arbitrary parameters. 



12.3 Mobility in Strained Silicon 

12.3.1 Mobility and Piezoresistance 



183 



At low strain value the inodification of mobility can be obtained by using piezore- 
sistance coefficients. The piezoresistance coefficients nijki are defined by 



^Pij 



= ^ijklOkl, 



(12.26) 



where Oki is the stress tensor defined in Sect. 3.1, and Ap^ is the stress-induced 
variation of the resistivity tensor py = p&jj which is diagonal in the crystallographic 
coordinate system in relaxed silicon. Like in the case of the elastic stiffness ten- 
sor (3.18), the number of independent entries for to the cubic semiconductors is 
reduced due to symmetry, and the piezoresistance tensor is characterized by only 
three independent components conveniently written as tt\\, 7Ti2, and 1144. In this 
case the relation (12.26) between stress and resistivity is written as: 
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Due to a simple relation Apijj/ii, = —Apij/p the stress-induced variation Afiy of the 
mobility tensor /Xy = pLSjj is expressed through the piezoresistance coefficients as: 
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(12.28) 



Silicon and germanium were the first materials used as piezoresistors. Smith [161] 
reported first measurements of large piezoresistive coefficients in bulk silicon. 
For a lightly doped p-silicon with resistivity of 7.8 Q-cm the piezoresistance 
tensor is 
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while for n-silicon the piezoresistance coefficients are: 
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We notice that the coefficient :7r44 determines the mobility modification due to shear 
strain. 

Although convenient, piezoresistance coefficients are found to depend on doping 
and temperature. The piezoresistance coefficients in the surface layers are different 
from bulk values [19] and depend on such values as an effective field in surface 
layers. A more refined theory is thus required in order to model mobility. 



12.3.2 Compact Mobility Modeling 



In some cases, the physically-based consideration helps to generalize the model and 
to include additional effects. In relaxed silicon, the lowest conduction band consists 
of six valleys, aligned along the crystallographic axes. In any valley, the mobility is 
anisotropic due to anisotropy of the inverse effective mass tensor: mobility is higher 
in the direction corresponding to a smaller effective mass. However, net electron 
mobility being the average of the components along the three valley orientations 
is independent of direction, or isotropic. To develop a model for stress dependent 
mobility, the peculiarity of the band structure modifications has to be taken into 
account. In silicon the conduction band consists of three pairs of equivalent val- 
leys with their minima located close to the X-points in the Brillouin zone. Uniaxial 
compression in [001] direction decreases the band energy of the valley parallel to 
the strain and transfers electrons into these valleys. Since the [001] valleys have high 
mobility in the (001) plane average mobility in this plane is increased. Alternatively, 
mobility in the direction of compression is lowered. Elongation in [001] direction 
has the opposite effect. 

Let us consider silicon biaxially stressed in the [001] plane. This type of stress 
is practically realized in Si grown on [001] SiGe substrate [195]. In this case the 
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degeneracy between the three pairs of valleys is lifted, and the relative shift AE^ 
between the in-plane and out-of-plane valleys appear. The shift is given as [10] 

AE^^^ = 3die,„ + £yy + Sz.J + SuEii, i=x,y,z, (12.31) 

where S^/ and £"„ denote the dilation and shear deformation potentials for the 
conduction band and the e„ denote the diagonal components of the strain tensor 
expressed in the principal coordinate system. 

The electron occupation in the valleys depends on their relative energy shifts 
and must be properly accounted for [30, 116]. As suggested in [116], the electron 
mobility can be computed by taking the weighted average of the electron mobil- 
ity tensors Jinuns of the i''^ pair of valleys in Si with the corresponding electron 
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Here n^J is calculated using Boltzmann statistics with A^^ as the effective density 
of states and AE^ as the energy shift, for the /''' valley, ks and T denote the 
Boltzmann's constant and ambient temperature, respectively. In (12.32) the mobility 
tensor is a product of a scalar mobility fiL and the scaled inverse mass tensor: 
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The tensors in (12.35) are the inverse effective mass tensors with mt, mi denoting 
the transversal and lateral masses for the ellipsoidal X-valleys in Si. The tensor is 
scaled to a dimensionless form by the conductivity mass, mc'. 
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Electrons in bulk silicon are scattered by phonons and impurities. Due to the later 
scattering mechanism the electron mobility depends strongly on the doping level. 
After scattering with phonons an electron may stay within the same valley or scatter 
into an equivalent valley along the same axis (^-type). The after-scattering state can 
be also located in a non-equivalent valley (/-type scattering) [76]. By applying the 
Matthiesson's rule one can model isotropic electron mobility in relaxed Si, includ- 
ing the effects of g-type, /-type phonon scattering, and dependence on the doping 

concentration Nj via the corresponding relaxation times Tequiv, tneqC^-E^ )' ^^^ 
ri{Ni)as 

li{Ni,AE';p) = -. ^. (12.37) 

/ 1 1 1 
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The electron mobility for the / valley in strained Si can be written as [25] 

ri^'}^(Ni,AE^^) = ^ ^-^ ■ «5y}. (12.38) 
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where m..J denotes the scaled effective mass tensor for the /''' valley in (12.35), and 

r 

h is a known function of T, AE^ [25]. The parameter fi = , where / is 

nic 
the mobility enhancement factor, is defined as the ratio of the saturation value of the 

transversal mobility in the valley / at high values of strain to the unstrained mobility 

fi^ . Equation (12.38) is plugged into (12.32) to give the total mobility tensor for 

electrons in strained Si as a function of doping concentration Nj and strain. 

The model was generalized to describe the low-field mobility in silicon under 

arbitrary stress conditions to include shear strain [26]. For tensile stress along [1 10] 

the energy dispersion of the lowest conduction band is influenced as follows (see 

Sect. 9): 

• The valleys located along the [100] and [010] directions move up in energy with 
respect to the valleys located along the [001] direction, in accordance with (8.2); 

• The band minima of the valleys oriented along the [001] direction move down 
in energy, in accordance with (9.2) and (9.8), and towards the Z -point located at 
^[0, 0, ±1], in accordance with (9.7); 

• The shape of the valleys along the [001] direction is warped which results in a 
modification of the transversal effective masses (9.9)-(9.10) and the longitudinal 
mass (9.11). 

In the case of the stress-dependent masses (9.9)-(9.10) the inverse effective mass 
tensor ot~^ for the [001] valleys in the crystallographic coordinate system is written 
as [26]: 
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and m^^ for the [001] valleys is defined as 

mc (ri) = ^ . 

In order to validate compact mobility models and calibrate their parameters, 
results must be used which are obtained by more precise methods, which, however, 
require considerably more computational resources. A feasible approach to obtain 
an accurate solution of the Boltzmann transport equation is based on the Monte 
Carlo techniques briefly discussed next. 



12.3.3 Monte Carlo Methods for Transport Calculations 

Methods based on Monte Carlo techniques are well established for studying trans- 
port in semiconductors [73]. The motion of charge carriers is simulated in the 
appropriate phase space formed by position and momentum. In the presence of 
external fields, the carriers which are considered as point-like objects with well 
defined momenta and positions move according to Newton's law on classical trajec- 
tories. A dispersion relation expressing the carrier energy dependence on the crystal 
momentum is determined by the semiconductor's energy band structure. The free 
flight of carriers along the trajectory is interrupted by scattering events which are 
assumed local in space and instantaneous in time. Scattering is modeled as a random 
process. The duration of a free flight, the type of scattering mechanism, and the state 
after scattering are selected randomly from given probability distributions charac- 
teristic to the microscopic scattering process. The method of generating sequences 
of free flights and scattering events appears to be so intuitively transparent, that it is 
frequently interpreted as a direct emulation of the physical transport process rather 
than a numerical method. The first Monte Carlo algorithms used in device simula- 
tions were originally derived from merely physical considerations, viewing a Monte 
Carlo simulation as a computer experiment. These algorithms are called Ensemble 
Monte Carlo (EMC) [68, 120] and One Particle Monte Carlo (OPMC) [76, 138]. 
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Alternatively, one can reformulate the transport equation as an integral equation 
first and then develop a Monte Carlo algorithm for its solution [75, 128]. When the 
Boltzmann equation is transformed to an integral equation which is then iteratively 
substituted into itself [75], the iteration series results in the technique called Monte 
Carlo Backward. The algorithm is useful, if rare events have to be simulated or 
the distribution function is needed only in a small phase space domain [94] . If the 
Boltzmann equation is reformulated in an adjoint integral form [90], a link between 
a physically-based Monte Carlo method and an iterative procedure of the solution 
of an integral equation is established. 

The work of Kurosawa in 1966 [101] is considered to be the first account of an 
application of the Monte Carlo method to high-field transport in semiconductors. 
The following decade has seen considerable improvement of the method and appli- 
cation to a variety of materials [73]. Early papers deal with gallium arsenide [35] 
and germanium [36]. In the mid 1970s a physical model of silicon has been devel- 
oped, capable of explaining major macroscopic transport characteristics [17,74]. 
The used energy band structure models were represented by simple analytic expres- 
sions accounting for non-parabolicity and anisotropy. With the increase of the 
carriers' energy the need for accurate, numerical energy band structure models arose 
[42, 100, 152, 197]. For electrons in silicon, the most thoroughly investigated case, 
it is believed that a satisfactory understanding of the basic scattering mechanisms 
gives rise to a new "standard model" [38]. With the introduction of strain to enhance 
the performance of MOSFETs, however, the need for accurate full-band transport 
analysis has regained considerable interest [41,43,79]. 

We demonstrate the importance of full-band consideration by using an analysis of 
the electron low-field mobility in strained Si as an example. We apply the simulator 
VMC [187], which offers simulation algorithms for both bulk semiconductors and 
devices with models based either on analytical band structure or on the full-band 
structure analyses. VMC includes a comprehensive set of scattering models with 
phonons, ionized impurities, alloy scattering, as well as impact ionization both for 
electrons and holes. 

Figure 12.6 demonstrates mobility enhancement of holes due to uniaxial stress 
applied along [111] direction to lightly doped p-silicon. Band structure was obtained 
with the empirical pseudopotential method described in Chaps. 5 and 8. Compres- 
sive stress favors significant mobility enhancement while tensile stress reduces hole 
mobility. The model based on the piezoresistance coefficients is also shown by the 
straight line. The piezoresistance coefficient value :7r<iii> = 71.8 x 10~" Pa~^ 
along the [111] stress direction [19] calculated from the values from [161] is in 
good agreement with the results of the full-band calculations for stress absolute 
value below 0. 1 GPa. For larger stress values the mobility enhancement factor devi- 
ates significantly from that predicted by the piezoresistance mode, especially for 
tensile stress above 0.5 GPa where the mobility modification saturates. 

For electrons in n-silicon, a phonon scattering model based on Jacoboni and 
Reggiani [76] is used. The model takes into account long-wavelength acoustic 
phonons causing intravalley transitions and three /- and g-type phonons for inter- 
valley transitions. Intravalley scattering from acoustic phonons is treated as elastic 
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Fig. 12.6 Hole mobility enhancement factor along the [111] stress direction obtained with the 
full-band Monte Carlo simulations (symbols). Prediction based on the piezoresistance model with 
parameters for lightly doped p-type silicon [19, 161] is shown by a straight line 

Table 12.1 Modes, coupling constants, phonon energies, and selection rule of inelastic phonon 
scattering 



Mode 


A [MeV/cm] 


hai [meV] 


Selection rule 


Transversal Acoustic 


47.2 


12.1 


f 


Longitudinal Acoustic 


75.5 


18.5 


f 


Longitudinal Optical 


1042.0 


62.0 


f 


Transversal Acoustic 


34.8 


19.0 


g 


Longitudinal Acoustic 


232.0 


47.4 


g 


Transversal Optical 


232.0 


58.6 


g 



process. Following [25] we slightly adjust the original values for the coupling con- 
stants for intervalley phonon scattering [76] to achieve a bulk mobility enhancement 
factor of 70% in biaxially strained Si layers. The coupling constants for acoustic and 
optical intervalley phonons, as well as the phonon energies are listed in Table 12.1. 
In full-band simulations the scattering rates are proportional to the density of states 
calculated from the band structure [79]. 

Figure 12.7 demonstrate the orientation-dependent electron mobility enhance- 
ment factor for tensile stress applied in [111] direction obtained from the full-band 
Monte Carlo simulations. The current direction is defined by the unit vector n 



n = [cos(6) + 0o)/V2, cos(6) + 0o)/V2, sin(6) + cPo)], 
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Fig. 12.7 Orientation dependent electron mobility variation in silicon under tensile stress along 
[111] direction, for several stress values. Results of full-band Monte Carlo are well explained by 
dependence of the effective masses on shear strain. Note a mobility degradation at high stress 
values caused by an increase of scattering due to stress-dependent increase of the density-of-states 
mass 



where 0o ~ 36° is the angle between [111] and [110] directions. The angle 
determines the current direction within the plane defined by the [111] and [1 10] is 
counted from the [111] direction. 

Results of the full-band simulations coincide well with the analytical model pre- 
sented in the previous subsection, even for high stress value. Stress along [111] 
direction affects all six valleys in a similar way and thus does not produce rela- 
tive energy shift between the valleys. [Ill] stress produces shear strain component, 
which in this case affects all six valleys in a similar way. In this case mobility of 
an ellipsoidal valley in an arbitrary orientation n with respect to the crystallographic 
coordinate system in a certain direction n is determined by the (inverse) conductivity 
mass l/rrin in this direction 

where m~^{r]) is the inverse effective mass tensor for a valley in the crystallo- 
graphic system. Taking into account the dependence of the density-of-state effective 
mass mdosij]) = {miW)>nt\(y])>nt2{^))^^^ on dimensionless strain r], the mobility 
dependence on [1 1 1] strain in a direction n is obtained as 



jiQ m„ 



mdos 
rridosin) 



ill 



(12.42) 



12.3 Mobility in Strained Silicon 
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Fig. 12.8 Simulated bulk mobility of intrinsic Si as a function of strain for stress direction [100] 
(a) and [1 10] (b). Mobility is plotted along the stress direction and along two orthogonal directions 
from full-band Monte Carlo (solid lines) and analytical band Monte Carlo (dotted lines). Symbols 
indicate the change of mobility calculated using the piezoresistance coefficients [161] 



where jiq is the mobility of relaxed silicon, nic = 2(1/ mi + 2/mt) '. The strain 
dependence of the inverse effective mass tensor m~^(f;) on shear strain is deter- 
mined by (9.9, 9.10, 9.11). For the [001] valleys nr^rj) is determined by (12.39, 
12.40, 12.41). 

Comparing simulation results from analytical band structure Monte Carlo with 
full-band Monte Carlo allows determining the limits of validity of the analytical 
band model. In Fig. 12.8 simulation results for the electron mobility of strained Si 
obtained with full-band and analytical band structures for the stress directions [100] 
and [110] as well as predictions from a model based on the linear piezoresistance 
coefficients [161] are shown. Mobility is plotted in three directions, one parallel and 
two perpendicular to the stress direction. Good agreement is found for both stress 
directions at small stress (<100 MPa). 

In Fig. 12.8a the simulation results from analytical and full-band Monte Carlo 
for stress along [100] are compared and good agreement is obtained. The resulting 
mobility is anisotropic in the (001) plane (/i[ioo] 7^ M[oio])> and can be explained 
by strain induced X- valley shifts. Mobility saturates at approximately 1% strain, 
regardless of the sign of strain. The saturated mobility values are larger for compres- 
sive strain, since in this case four X-valleys with unfavorable conductivity masses 
are depopulated. 

In Fig. 12.8b simulation results are shown for stress along [110]. At compres- 
sive stress, four valleys move down in energy, yielding a decreased mobility in the 
(001) plane and mobility enhancement along [001]. For tensile stress along [110], 
the mobility behavior is remarkably different from the previous case. First, mobili- 
ties along the directions [110], [110], and [001] are different from each other, with 
the largest mobility enhancement observed in [1 10] direction. Furthermore, no clear 
in-plane mobility saturation is observed as the value of stress increases. The mobility 
enhancement in tensile stress direction is determined by the effective mass change 
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induced by the shear strain component in the primarily populated valleys along 
[001] [168, 183]. Thus, analytical models solely based on strain induced interval- 
ley electron transfer [67] would fail to predict correctly the mobility enhancement 
along [110] and to explain the origin of the non- vanishing shear piezoresistance 
coefficient :7r44 = —13.6 • 10~"Pa~', hence these models are not capable of repro- 
ducing the anisotropy of electron mobility in uniaxially stressed channels with [110] 
channel direction. 

It can be seen that the results from analytical band structure Monte Carlo agree 
well with those from full-band results up to 0.5% shear strain. At larger strain levels 
the band deformation is so pronounced that the energy band description in terms of 
an effective mass is no longer accurate, and full-band Monte Carlo simulations must 
be used to calculate the low-field mobility. 



12.4 IVIixed Quantum-Semi-Classical Description 
and Quantum Corrections in Current 
Transport IVIodels 

The characteristics of modern semiconductor devices are strongly influenced by 
quantum-mechanical effects. Due to this fact, purely classical device simulation may 
not suffice to accurately reproduce transport. Size quantization of carrier motion 
in the confining potential of an inversion layer is the most investigated and well 
understood quantum-mechanical effect in modern MOSFETs. Because of the size 
quantization, the energy spectrum becomes discrete in the confinement direction, 
while it is still continuous in the transport direction. Thus the three-dimensional 
energy band structure is partitioned into a set of two-dimensional quantum sub- 
bands. In order to find the subband energy levels and the corresponding wave 
function, the Schrodinger equation has to be solved. Within each subband trans- 
port can be described by a set of Boltzmann equations for the subband distribution 
functions. Unlike the bulk case, the Boltzmann equations for different subbands are 
coupled because of intersubband scattering. This complicates the computation of the 
mobility. Even more, in inversion layers and especially in FETs with ultra-thin body 
(UTB) the subband occupation may be large at high effective fields, and degeneracy 
effects due to Pauli exclusion principle start playing an important role. 

Indeed, the Pauli exclusion principle forbids the double occupancy of the same 
quantum state. Therefore, scattering to an already occupied state is prohibited. In 
order to describe transport correctly, degeneracy effects must be carefully taken 
into account. Below an efficient Monte Carlo algorithm based on the linearized 
Boltzmann equation including usually ignored effects of the fermion statistics and 
degeneracy is presented. Degeneracy effects lead to enhanced intersubband scat- 
tering in (001) double-gate (DG) UTB FETs which results in a carrier mobility 
qualitatively different from the case when degeneracy is ignored. 

The potential entering into the Schrodinger equation depends on the charge 
carrier concentration in the inversion layer. By knowing the wave functions and 
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occupations of the subbands, the charge carrier concentration in the inversion 
layer can be obtained. Therefore, the potential has to be found self-consistently 
by solving the Schrodinger and the Poisson equations simultaneously. This pro- 
cedure is time consuming and should be avoided whenever timely results must 
be obtained. One option is to exploit the well established semi-classical trans- 
port models, while correcting them in such a way that they mimic the behavior 
of the complete quantum-mechanical system. Various quantum correction models 
are available [29, 65, 80, 1 30, 131]. Some of these corrections are based on empirical 
fits with numerous parameters [80, 130]. In other models the convergence [29] is 
strongly affected by the dependence on the electrical field. Some models [65] have 
to be re-calibrated for each particular device. A comprehensive comparison of these 
models was recently performed in [189]. 

The quantum correction which mimics the local density of states close to the 
interface can also be understood as an additional quantum potential [37] which has 
to be added to the classical self-consistent potential in order to describe the decrease 
of carrier concentration at the interface correctly. Because this additional quan- 
tum potential enters into the current relations, it opens an opportunity to introduce 
quantum-mechanical effects into the drift-diffusion and higher moments transport 
models. In fact, the appearance of the quantum potential can be easily illustrated, if 
one just substitutes the wave function represented as the product of its real amplitude 
and the exponent containing the phase factor 

iA(r, t) = A{r, t) exp{i(P{r, t). (12.43) 

into the Schrodinger equation. The density n{r,t) = ^^(r, ?), the velocity v = 
hV(p{r, t)/m, and the current j = A^v are introduced and one obtains the following 
system of equations [185]: 

dn(r,t) 

\ + V [«(r, Ov] = 0, (12.44) 

at 

^ = ^ + Virj)+VQc{r,t). (12.45) 

Here, the additional quantum correction potential Fgc(r, t)h given by the following 
expression: 

2m V« 

A similar expression for the effective quantum correction potential can be ob- 
tained from the quantum-mechanical theory based on the evolution of the moments 
for the Wigner function [69]: 

VQc(r,t) = - — Alnn, (12.47) 

3r 

where r is the parameter, taking values between r = 1 in the extreme case with only 
a single subband filled and r = 3 in case of many subbands filled. 



Fgc(r,0 = - — ^^. (12.46) 
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The concept of quantum correction potentials opens the way for using the 
particle-based Monte Carlo algorithm to investigate transport in systems with strong 
size quantization. Therefore, the full power of three-dimensional full-band Monte 
Carlo methods which include accurate band-structure and scattering processes 
to obtain transport characteristics in inversion layers and SOI structures can be 
applied [83, 134]. Recently, mobility of stressed Si/SiGe inversion layers was 
investigated [34,99, 184]. Interestingly, due to a concentration decrease at the inter- 
face, the surface roughness scattering is underestimated, which requires special 
corrections [15]. Due to statistical noise in the second derivative of the carrier con- 
centration, smooth approximations for the quantum correction potential are used in 
practice. The popular form of the effective potential introduced by Ferry [37] relates 
it to the self-consistent potential via an integral smoothing relation with a Gaussian 
kernel. Using this quantum correction potential, transport characteristics in ultra- 
thin body structures with short channels were recently investigated and compared 
against analytical predictions [133]. 

The form of the quantum potential (12.47) is commonly referred to as the density 
gradient correction and is extensively used in quantum hydrodynamic calcula- 
tions [3-6]. The expression for the current density including the quantum correction 
reads [8]: 

h'^ AJn 
J = ennE + eDyn + enii — y^^. (12.48) 

2r V" 

Substitution of the current relation (12.48) into the continuity equation (12.2) 
results in the differential equation for the particle concentration n of the fourth 
order. Such an equation needs two boundary conditions. If one considers the inter- 
face between the semiconductor and the dielectric, the first boundary condition is 
the standard one put on the normal derivative of the concentration requiring the 
normal current component to be zero at the interface. The second boundary condi- 
tion allows to set the carrier concentration to zero at the interface. Thus, the quantum 
drift-diffusion theory based on (12.48) supplemented with the corresponding bound- 
ary conditions automatically reproduces the concentration decrease at the interface, 
mimicking the quantum-mechanical behavior. 

For numerical transport calculations it is convenient to avoid the discretization of 
the fourth order equation and to include the quantum potential correction term into 
a generalized electro-chemical potential [8]. After a careful calibration of the result- 
ing density gradient model the transport calculations including the source-drain 
tunneling in ultra-scaled MOSFETs becomes possible [8]. Quantum hydrodynamic 
models are reviewed in [78]. 

We are not aware of any generalization of the quantum correction method to 
include strain. The possible reason is that, although the calibration of quantum cor- 
rection to include strain seems possible, it is not straightforward to convert the 
modification of the band structure in stressed silicon to a particular form of the 
quantum correction potential. Indeed, according to (12.43), the wave function as 
a solution of the Schrodinger equation is required to guess the form of the quan- 
tum correction potential. Since the accurate evaluation of the quantum-mechanical 
carrier density is based on the calculation of the eigenstates of a Hamiltonian, it is 
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beneficial to solve the Schrodinger and Poisson equations self-consistently on slices 
in the quantization direction perpendicular to the transport direction. In this case 
carriers belong to quantum subbands. The motion along the channel in this quantum 
subbands is characterized by the subband dispersion relations E„(k}. This disper- 
sion relation can be straightforwardly integrated into a Monte Carlo algorithm to 
solve a transport equation within the subband. Thus, in the MOSFET devices with a 
channel length larger than the mean free path, a mixed quantum-mechanical descrip- 
tion in the quantization direction and a classical description along the transport 
direction can be used. Strain-induced band structure modifications and scattering 
rates can be easily introduced in the method allowing an accurate analyses of 
transport properties in stressed MOSFETs. 



12.4.1 Subband Monte Carlo and Degeneracy Effects 

Monte Carlo is a well-established numerical method to solve the Boltzmann trans- 
port equation. Within this approach, particles are moving on classical trajectories 
determined by Newton's law. The motion along the trajectory is interrupted by 
scattering processes with phonons and impurities. Scattering is modeled as a ran- 
dom process. The duration of a free flight, the scattering mechanism and the state 
after scattering are selected randomly from a given probability distribution which is 
determined by the scattering process. 

The Pauli exclusion principle prohibits scattering into an occupied state. There- 
fore, scattering rates depend on the occupation probability of the after- scattering 
state, given by the distribution function, which is the solution of the Boltzmann 
equation. Dependence of scattering rates on the solution makes the Boltzmann trans- 
port equation nonlinear. In many cases the occupation numbers are small and can 
be safely neglected in transport simulations. However, in the silicon-on-insulator 
based MOSFETs utilizing thin silicon films the occupation probabilities become 
high and can reach values close to unity at high carrier concentrations. In this case, 
a self-consistent solution of the Boltzmann transport equation with respect to the 
occupation probabilities given by the subband distribution functions is needed. 

Different approaches are known to include degeneracy effects into Monte Carlo 
algorithms. When the distribution function is close to the equilibrium solution, 
the blocking factor can be approximated with the Fermi-Dirac distribution func- 
tion [42]. Another method method is to compute the occupation numbers self- 
consistently [12, 111]. This approach is applicable not only to low-field mobility 
simulations but also for higher driving fields [110]. Recently, the method was gen- 
eralized to include the subband structure obtained from a self-consistent solution 
of a k-p-based Schrodinger and Poisson equations. The method describes hole 
transport in a silicon device subject to arbitrary stress conditions. Because the sub- 
band distribution functions have to be obtained self-consistently, the method is 
computationally very demanding. 



196 12 Demands of Transport Modeling in Advanced MOSFETs 

In practice, engineers are interested in obtaining the device current-voltage 
characteristics timely, therefore, a simplified approach based on a semi-classical 
transport model is frequently applied. In this case, one needs to model the trans- 
port model parameters including their dependence on stress. Due to the presence 
of an interface, the surface layer mobility is affected by extra scattering mech- 
anisms. Surface roughness scattering, phonon scattering, and Coulomb scattering 
at the interfacial states are the most important scattering mechanisms limiting sur- 
face mobility at high, intermediate, and low carrier concentration, respectively. Each 
scattering mechanism depends strongly on the value of the effective field confining 
the carriers at the interface. The effective field depends strongly on the applied gate 
voltage. Stress alters the relative occupation of the valleys, subbands, and their dis- 
persion at the gate voltage fixed, however, the degree to which these changes appear 
depend on the value of the effective field. Therefore, stress-induced mobility modifi- 
cation is a function of the gate voltage. It then follows that even at low stress values, 
when the mobility modification can be characterized by piezoresistance coefficients, 
it is not very convenient to use these coefficients for a particular device because of 
their dependence not only on doping level but on the effective field as well. Thus, 
a reliable method to accurately model mobility in stressed surface layers and SOI 
structures is required. 

A Monte Carlo algorithm for arbitrary stress conditions which includes the 
degeneracy effects for mobility calculations is developed [ 1 60] . The method can be 
generalized to include subband structure [167] and magnetic field [81]. The method 
is based on the linearized version of the Boltzmann transport equation and thus 
incorporates degeneracy effects exactly in the limit of vanishing driving fields and 
is valid for arbitrary scattering mechanisms and for general band structure. 

In order to obtain the low-field mobility, one compute the response to a small 
electric field E(?). Generated by this field a small perturbation 5/„ (k, t) of the equi- 
librium Fermi-Dirac distribution /o(£'„(k)) in each quantum subband satisfies the 
system of coupled linearized subband equations: 

^M^^ = -eE(OVk/o(£„(k)) + Qn[Sf], (12.49) 

where Qn is the scattering operator of the linearized Boltzmann equation 

——(AUK k')5/™(k', t) - yl,„„(k', k)5/„(k, 0). (12.50) 
„, (2:r)2 

The scattering rates 7l,„„(k, k') in (12.50) are related to the rates S„,„{]i, k') of the 
original Boltzmann equation via 

yl„,„(k', k) = (1 - /o(£™(k')))S,„„(k', k) + /o(£^(k'))S„,„(k, k'), (12.51) 

where E„ (k) is the total energy in the n-th subband. The equation for the perturba- 
tion has a form similar to the Boltzmann equation, with two important differences: 
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(i) the presence of a source term which depends on the small electric field and is 
proportional to the derivative of the equilibrium function, and (ii) renormalized scat- 
tering rates which enforce the equilibrium solution of the homogeneous equation 
(12.49) to be /o(£„(k))(l - ME„(k))), and not /o(£„(k)). 

In order to calculate the mobility, a subband Monte Carlo method is used to solve 
the system (12.49). Following the procedure outlined in [160], we assume the time 
dependence of the driving field to be a set of instantaneous delta-like pulses: 

E(f) = Eor^S(f-f,). (12.52) 



In (12.52) r is the average period between the delta-pulses and Eq is the value of 
the field averaged over a long simulation time T: 

T 



We compute the current response I, (?) produced by an electric field pulse at the 
moment r, as 

I,- = eH(t - ti) V f -^y„8f„it - ti), (12.53) 

where v„ is the velocity in the «-th subband, and H{t) is the Heaviside function. 
The instantaneous current density J(?) = ^^ I, is calculated as the sum over current 
densities I,- produced by all pulses / . The current density value averaged over some 



long time T is then expressed as 



^f^f 



T 
dtli(t). 




The low field mobility is defined as fiafi = Ja/(enEp), where the direction of the 
/J-axis coincides with the direction of Eq, and n is the carrier concentration. Now 
the mobility can be easily computed using a single-particle Monte Carlo technique. 
The method can be illustrated as follows. The diffusion tensor D^p is calculated 
as an integral of the velocity auto-correlation function [145] 

oo 

Dap = j dx [va(t)vp(t + r)) , (12.54) 



where angular brackets denote the time averaging over the stochastic dynamics 
determined by the rates yl,„„(k, k') of the linearized multi-subband Boltzmann 
scattering integral in case of degenerate statistics. 
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The mobility tensor /Xq,^ is related to the diffusion tensor via the Einstein relation 
for degenerate statistics 

1 dn 
flap = e Dap-— r^, (12.55) 

n dEp 

where Ep is the Fermi level. 

In order to compute the mobility, we accumulate three temporary estimators t, 
wp, and Vafi during the Monte Carlo simulations: 

1. Initialize t = 0, wp = 0, v^fi = 0, and start the particle trajectory with the 
stochastic dynamics determined by the scattering rates ylm„(k, k') from (12.51) of 
the linearized multi-subband Boltzmann equations; 

2. Before each scattering event update Vq,^ , wp , and t: 

t = t+ '^'^ 



yvp = wp +vp{j)r{i), 

VaP = Van + '^U)Va(j)wii(j); 

3. When t is sufficiently large, compute the mobility tensor as 



knT t 



where VaiJ) denotes the a -component of the velocity, E{j) is the particle energy, 
f{E) is the Fermi-Dirac function, and t(J) is the time of j'-th free flight. The 
convergence of the method is improved by resetting wp = each time a velocity 
randomizing scattering event occurs. 

The multi-subband generalization of the method requires the subband ener- 
gies and wave functions to be pre-calculated self-consistently, for each value of 
the effective field. The transport calculations account for electron-phonon interac- 
tion [76, 139] and surface roughness scattering, which are the dominant mechanisms 
determining the mobility in the region of high effective fields. This captures the 
main difference of the mobility behavior in Double-Gate (DG) and Single-Gate (SG) 
structures at high concentrations for different orientations, as we demonstrate below. 

In order to validate the computation procedure, we compare the results of 
mobility simulations with data available in the literature [31], see Fig. 12.9. Our sim- 
ulations are in good agreement with previous calculations for a FET with 20 nm and 
2.5 nm thick (100) thick silicon body, assuming similar surface roughness scattering 
parameters. 

With downscaling of semiconductor devices continuing, the introduction of 
double-gate silicon-on-insulator field-effect transistors with ultra-thin silicon body 
seems increasingly likely [106]. The second gate allows to achieve excellent elec- 
trostatic channel control [106] which makes these devices perfect candidates for the 
far-end of ITRS scaling [1]. 
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Fig. 12.9 Mobility simulations for SG (100) SOI FETs (filled symbols). Results are in good 
agreement with those reported previously [3 1 ] (open symbols) 
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Fig. 12.10 Mobility simulations for a 3 nm thick SG structure, using Boltzmann and Fermi-Dirac 
statistics. Mobility calculated without Pauli exclusion principle taken into account is overestimated 
at high earner concentrations. It is due to different subband population for the same #,., with and 
without degeneracy effects included. Inset: Boltzmann and Fermi-Dirac distributions for the same 
temperature and A^j 



Mobility in a single-gate (100) FET is compared for degenerate and non- 
degenerate statistics of carriers. Figure 12.10 shows that neglecting the Fermi block- 
ing factor leads to a significant overestimation of mobility, especially at higher 
carrier concentrations. The difference between the mobility values for degenerate 
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and non-degenerate statistics looks surprising. Indeed, at high carrier concentra- 
tions the principal scattering mechanism limiting the low-field mobility is elastic 
surface roughness scattering. For elastic scattering the forward and inverse scatter- 
ing rates are equal, and the Pauli blocking factor cancels out from the equations for 
the elastic scattering rates. However, the Pauli blocking factor is also present in the 
inelastic electron-phonon part of the total scattering integral and ensures the equi- 
librium solution to be the Fermi-Dirac distribution function. In the degenerate case 
the higher value of the Fermi level may open additional channels for inter-subband 
scattering, which reduces the mobility. 



12.4.2 Simulation Results for Mobilities in Single- 
and Double-Gate FETs 

After the surface roughness parameters were calibrated to reproduce the bulk (100) 
surface mobility, the same model parameters are used to simulate mobilities of 
single- and double-gate (100) structures. Results are compared in Fig. 12. 1 1 for sev- 
eral Si body thicknesses. For a 20 nm thick Si body the two inversion layers formed 
at the opposite interfaces of the DG structure are well separated and independent. 
The double-gate FET mobility plotted as a function of the concentration per single 
channel is in good agreement with the results for a SG structure. Both curves are 
in reasonable agreement with experimental data for a 32 nm thick Si body [180] 
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Fig. 12.11 Mobility for (100) UTB FETs. Mobility is plotted as a function of concentration for 
the SG structure and as a function of half of the concentration for the DG structure. Mobility in the 
DG structure is smaller than that in the SG structure at high concentration, which is in qualitative 
agreement with the experimental data [181] and in contrast to the bulk inversion concept [9] 
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and reproduce the universal mobility curve [175]. For 3 nm body thickness the DG 
mobility, plotted as a function of Ns/2, tends to be slightly higher at small effective 
fields. However, for high A^^ the DG mobility for (100) orientation becomes lower 
than the SG mobility. This is in qualitative agreement with experimental data, also 
shown in Fig. 12.11. This behavior contradicts the volume inversion concept [9]. 
According to this concept in volume inversion carriers are located in the center of 
the Si body and are further away from the interfaces. Due to reduced surface rough- 
ness scattering the mobility in a DG structure is expected to be higher than the 
mobility in a SG structure. 

It should be noted that in order to obtain a better quantitative agreement with the 
experimental mobility, additional mechanisms such as surface optical phonons inter- 
action, or scattering on fluctuations of STsoi [32] could be included. Introduction 
of acoustic phonon confinement relevant to FET structures with thin silicon bod- 
ies also improves agreement with the experimental data [27]. However, our model 
captures the main difference in the mobility behavior in DG and SG structures at 
high concentrations, namely that the DG mobility is lower than the SG mobility. It 
should also be noted that in DG structures the mobility plotted as a function of the 
total concentration is always higher than in SG structures, as was pointed out in [33] . 

Using a thick body DG FET, we demonstrate the importance of inter-subband 
scattering for mobility calculations. For a thick DG structure the wave functions 
^i (z) corresponding to two independent inversion layers / = 1,2 are located at the 
opposite interfaces. Due to the large separation between the interfaces, the value of 
the wave function and its derivatives is zero at the interface opposite to its location. 
By making symmetric V^+(z) and asymmetric "f-iz) combinations of V^i(z) and 
i^iiz), one obtains the wave functions corresponding to the first and the second 
subband in the Si body: 

i^±(z) = ^(iAi(z) ± fiiz)). (12.56) 

7(2) 

For the sake of simplicity let us consider surface roughness scattering only at the 
left interface, z = 0. The surface roughness scattering rate /l/(^) between the two 
subbands k and j is proportional to [7, 137] 

where the derivative dij/k(0)/dz of the /:-th wave function is computed at the 
interface z = 0, and m- is the quantization mass. For a thick body FET it holds 
fi?^2(0)/(iz = 0, and the total scattering rate within the first subband if+iz) is: 



2 \2mJ [ dz ) ' 
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where both 7~++ and F_+ are: 



1 / fi^ 



-)\ 



dfm 
dz 



(12.59) 



Therefore, a half of the total scattering rate would be missed if the intersubband 
scattering were neglected. A similar conclusion also holds for electron-phonon 
scattering [157]. 

Now we turn our attention to (1 10) UTB FETs. Due to the surface orientation, the 
effective equation for the envelope functions in quantization direction contains also 
the first order derivative of the wave functions. The derivative can be removed from 
the equation by a wave function transformation [163], which results in a standard 
Schrodinger-like equation for the envelope function in the quantization direction, 
allowing the determination of quantization mass, conductivity mass and density- 
of-state effective mass. For (110) orientation the subband structure consists of a 
fourfold and a twofold degenerate subband ladder. Subbands in both ladders are 
anisotropic. Simulation results for 20 nm thick SG and DG structures are shown 
in Fig. 12.12. The results for SG and DG structures, which are plotted as a function 
of the concentration per inversion layer, are in good agreement with each other 
and with experimental data [180]. Due to the high anisotropy of the subbands 
of each ladder the surface mobility is anisotropic. Diagonal values of the mobil- 
ity tensor along (001) and (110) directions are in agreement with those observed 
experimentally [180]. 
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Fig. 12.12 Low field mobility in different directions calculated for 20rmi (110) SG and DG 
structures {filled symbols) closely follows experimental data [180] (open symbols). Mobility tensor 
components per inversion layer in thick DG and SG structures are the same 
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Fig. 12.13 Mobility in ( 1 10) direction for SG and DG structures with (110) thin body orientation. 
Mobility is plotted as a function of concentration for SG structures and as function of half the 
concentration for DG structures. DG (110) UTB mobility is higher than the SG mobility in whole 
range of concentrations, in good agreement with the experimental data [180], supporting the bulk 
inversion concept [9] 



Figure 12.13 shows results of mobility simulations in (110) direction performed 
for several Si body thicknesses using the same scattering parameters as for thick 
Si films. The mobility of (1 10) UTB FETs coincides reasonably well with the exper- 
imental data [180] shown in Fig. 12.13. The mobility for (110) DG structures is 
higher than the mobility of (1 10) SG structures in the whole concentration range. It 
is consistent with recent experimental studies [156, 180] and confirms the concept 
of the mobility enhancement in DG UTB FETs due to volume inversion [9]. 

Mobility in a (100) UTB double-gate FET plotted as function of carrier con- 
centration per inversion layer becomes lower than the mobility in a corresponding 
SG FET. This is in sharp contrast with results for (110) FETs mobilities and is in 
obvious contradiction with the concept of the mobility enhancement due to volume 
inversion. In order to explain the apparent contradiction, high occupation of primed 
subbands in (100) DG FETs with a significantly lower mobility caused by heavier 
conductivity mass was assumed in [ 1 80] . The higher occupation of primed subbands 
in DG FETs was justified by smaller energy splitting between the lowest primed and 
unprimed subbands in a (100) DG FET induced by the interaction of the inversion 
layers [180]. However, in UTB FETs the subband structure is mainly determined 
by strong geometrical confinement. Therefore, the subband structure and the occu- 
pation of subbands should be quite similar in UTB DG and SG FETs provided 
the total carrier concentration is the same in both structures. At the same time the 
SG and DG mobilities should be compared for the same gate voltages. In case of 
an intrinsic body this corresponds to the mobilities compared as functions of the 
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concentration per single inversion layer. Since in a DG FET there are two inversion 
layers created at the front and the back interface of the Si body, the total concentra- 
tion in a DG structure is twice as high as in a SG structure for the same gate voltage. 
Also due to the double carrier concentration the occupation of the primed subbands 
can be higher in DG structures. 

In order to compare relative contributions of two different mechanisms to increase 
of the primed subband occupation in a DG FET with a SG FET, in Fig. 12.14 we 
plot the occupation of the primed subband ladder as a function of the total con- 
centration in SG and DG structures. The contribution A2 computed at the same 
total concentration is due to the difference in energy splitting between primed and 
unprimed ladders and corresponds to the mechanism proposed in [180]. This con- 
tribution plays a minor role compared to the increase A 1 of the primed subband 
occupation due to the doubled carrier concentration in DG structures for similar 
gate voltages. 

If we now apply similar argumentation to (110) UTB FETs, we find that the 
occupation of primed subbands increases rapidly in DG (110) UTB FETs at high 
effective fields, which should also lead to a more rapid decrease of DG mobility in 
(1 10) FETs due to the lower mobility in primed subbands. However, this contradicts 
the experimental data and the results of simulations. Higher occupancy of primed 
subbands in DG structures is present in both (100) and (1 10) UTB FETs and cannot 
be the only reason for the mobility lowering observed in DG (100) structures. 
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Fig. 12.14 Occupation of primed ladder in a 3 nm tliick (100) UTB FET as a function of total 
concentration for eitlier SG or DG operation mode. Higher primed ladder occupation in DG FET 
is mainly due to the Ai contribution resulting from twice-as-high carrier concentration in DG 
as compared to SG FETs, for similar gate voltage. Contribution A^ due to subband structure 
difference [180] is small 
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Due to different quantization masses, the subband structures in (100) and (110) 
UTB FETs differ significantly. In (100) UTB FETs the subband energy of the second 
unprimed subband is nearly equal to the energy of the first primed subband, while 
in (110) FETs the second unprimed fourfold degenerate subband lies significantly 
higher than the lowest primed subband. Even though the occupation of primed sub- 
bands increases for both (100) and (110) orientations with increasing effective field, 
higher unprimed subbands remain virtually de-populated in (110) UTB FETs. Con- 
trary, the occupation of higher unprimed subbands increases rapidly in DG (100) 
FETs for high effective fields. Figure 12.15 illustrates the higher unprimed subband 
occupation in (100) compared to (1 10) DG FETs. 

Figure 12.16 shows a strong correlation between the rapid mobility degradation 
and the occupation increase of the higher unprimed subbands in a (100) DG struc- 
ture. However, estimations demonstrate that the fraction of carriers occupying the 
higher subbands is not sufficient to cause such a significant drop in the DG (100) 
FET mobility. A more substantial correction comes from the mobility lowering in 
the ground subband itself. Since the higher unprimed subband is occupied in (100) 
DG structures, an extra channel of scattering between the ground and the second 
unprimed subbands opens. This mechanism plays a significant role in the (100) DG 
mobility lowering: If we switch off the inter-subband scattering in the simulations, 
thus forbidding the scattering between the lowest and higher unprimed subbands, 
the DG mobility exceeds the corresponding (100) SG mobility, even at high carrier 
concentrations, in compliance with the volume inversion concept. 

Degeneracy effects therefore play a significant role in compensating the volume 
inversion induced mobility enhancement in (100) DG structures. They lead to a 
significant occupation of higher subbands in the unprimed ladder, which results in 
increased intersubband scattering and mobility lowering reported in [156]. 
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Fig. 12.15 Illustration of higher unprimed subband occupation in (100) and (110) UTB FETs. 
Contrary to (110) FETs, the higher unprimed subbands are occupied significantly in DG (100) 
FETs, for the same concentrations, due to a heavier quantization mass and a lighter in-plane 
density-of-states mass 
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Fig. 12.16 Con'elation between mobility degradation and occupation of higher unprimed 
subbands in a 3 nm thick DG (100) SOI FET 



12.4.3 Electron Mobility Enhancement in FETs 
with Ultra-Thin Silicon Body 



The (001) substrate orientation intrinsically yields the largest effective mobility for 
electrons [71, 180]. Let us now apply uniaxial stress to a (001) oriented substrate. 
Two cases are considered: stress and channel both parallel to the [110] direction or 
to the [100] direction. In addition, the impact of the silicon body thickness on the 
mobility enhancement is investigated. 

In Fig. 12. 17a the mobility parallel and perpendicular to the [110] channel direc- 
tion [110] under 1 GPa tensile stress along [110] is compared to the unstrained 
mobility for two Si body thicknesses. At relatively large body thicknesses (t > 
20 nm) tensile stress along [110] direction has two beneficial effects on the [110] 
mobility. First, the splitting between the unprimed and primed subband ladders is 
increased, leading to higher occupation of the unprimed ladder with lower conduc- 
tivity effective mass. Second, the transport mass in stress direction is reduced with 
respect to the relaxed case, according to (9.10). These two effects determine the 
mobility enhancement at all inversion layer densities. 

Contrary, in the direction [110] perpendicular to stress, the effective mass is 
increased (9.9). This resulting to a smaller mobility enhancement in this direction 
at small inversion layer concentration, which further diminishes at larger inversion 
layer concentrations. 

In ultra-thin body FETs with the thickness (t < 5nm), the energy separation 
between the subband ladders is already high due to strong quantum confinement. 
Thus, the additional valley splitting due to stress have a negligible effect on the 
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Fig. 12.17 Simulated electron mobility for substrate orientation (001) of unstressed (solid lines) 
and 1 GPa stressed (dashed lines) silicon for two body thicknesses. The mobility components are 
plotted with filled symbols in the direction parallel to stress and with open symbols in the direction 
perpendicular to stress along (a) [110] and (b)[100] direction 



occupation of unprimed subbands and mobility. Therefore, larger (smaller) mobility 
parallel (perpendicular) to the [110] stress direction is because of the effective mass 
change (9.9,9.10) alone, in good agreement with experimental data [71, 182]. 

In Fig. 12.17b the effect of [110] uniaxial stress on [100] channel mobility is 
shown. Stress along [100] lifts the degeneracy of the fourfold (primed) ladder by 
pushing the subbands resulting from the [100] oriented valleys up in energy. Thus, 
the occupation of the primed subbands from the [ 1 00] ladder is reduced. Since these 
subbands have a larger transport mass in [100] direction, by depopulating them 
we enhance total mobility in [100] direction in thick films. Alternatively, mobility 
perpendicular to stress is reduced due to an increased occupation of [010] primed 
subbands with unfavorable mobility along [010] direction. 

In ultra-thin films, however, the occupation of primed subbands is negligible 
due to high energy separation between primed and unprimed subbands. There- 
fore, stress-induced shift between the [100] and [010] primed subbands does not 
result in subband repopulation. Since no effective mass change occurs the mobility 
enhancement vanishes at f = 2.4 nm. 



12.4.4 Stress-Induced Mobility and Drive 
Current Enhancement 



We now summarize the latest results on stress-induced transport enhancement 
obtained by different groups. It is predicted [165] that the highest enhancement 
factor for surface mobility of holes is achieved, when uniaxial compressive stress is 
applied along the [110] channel direction to the hole surface channel at the (001) sil- 
icon interface. Physics of mobility enhancement is determined by the strain-induced 
valence band warping [191] shown in Fig. 8.4 which results in the smaller conduc- 
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tivity mass and enhanced current in the transport direction. The enhancement factor 
rises almost linearly with stress up to 3 GPa, where it reaches the value close to four. 
At higher stresses a saturation in the hole mobility enhancement is reported [165]. 
For the (110) substrate orientation the enhancement factor is lower, however, at 
high stress the hole mobility value is found to be the same as for the case of (001) 
orientation for [110] stress/channel direction, independent on whether compressive 
stress/channel is in (1 10) or (1 1 1) direction [165]. These observations are confirmed 
in [97], where it is demonstrated that for the same [110] direction of the channel the 
response at high stresses is determined by the stress and transport orientation and not 
by the wafer orientation. By applying deterministic Boltzmann/multisubband k-p- 
based device solver [135] it is shown that values for the drive current enhancement 
achieved are comparable to that of the mobility enhancement, in the broad range of 
drive voltages. 

In case of electrons, the highest mobility enhancement factor due to tensile uni- 
axial stress is predicted for (1 10) substrate orientation, when stress is applied along 
the (110) channel direction [97, 98]. This behavior is consistent with that expected 
from the subband structure modification discussed in Sect. 1 1.8. It is also supported 
by recent mobility simulations in FinFETs where the highest mobility enhancement 
was reported for a longitudinally elongated (110) oriented FinFET [151], due to a 
substantial contribution in the total mobility of [1 10] oriented interfaces. 

It is shown [97] that, even at high level of stress close to 3 GPa mobility on (1 10) 
substrate remains inferior to [110]/(001) channel mobility of uniaxially elongated 
along the [110] direction. Therefore, the standard [110]/(001) channel orientation 
allows achieving superior electron mobility under practically relevant values of uni- 
axial stress [97, 98, 178, 183]. The reason for the highest mobility in this case is the 
depopulation of primed subband ladder with unfavorable conductivity mass aug- 
mented by the favorable strain-reduced transport mass reduction along the stress 
direction in the unprimed subband ladder. 

Recently, alternative materials with higher mobility like germanium [18, 88] and 
III-V semiconductors [142, 143] were considered as possible substitutions to silicon 
to boost the channel transport properties. It is interesting to note that the trans- 
port properties of these new channel materials can be significantly enhanced by 
stress [88, 121, 143] extending applicability of currently developed stress technolo- 
gies to several technology generations ahead. 



12.5 Quantum Transport IModels 

As device sizes become comparable to the electron wave length, quantum effects 
start playing a significant role. At the same time scattering with phonons, impurities, 
and surface roughness may still be strong and cannot be neglected. At this point a 
full quantum transport formalism which consistently describes quantum coherent 
propagation and dissipative scattering processes, must be employed. 
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Established techniques used to address dissipative quantum transport can be clas- 
sified according to the functions they are based upon: the nonequilibrium Green's 
functions, the density matrix, and the Wigner function. All three approaches are 
based on fundamental equations of motion and are equivalent at the most general 
level of formal description of a dissipative quantum system. The resulting system of 
integral-differential equations for the Green's function G(ri , ra, fi , ?2), or the den- 
sity matrix p(ri,r2,ri,r2), or the Wigner function /(r, p, o),?) is in many cases 
too complex to allow for a direct numerical solution. Each function depends on two 
vector and two scalar arguments. For a numerical solution, each argument of the 
function must be discretized. In the case of a three-dimensional system {d = 3) 
the total number of unknowns is A^tot = (Nx • Ny • N, ■ Nt)^ . Assuming 100 grid 
points for each argument this results in the astronomical number A^tot = 10'^, which 
results in a prohibitively large memory requirements. Approximations and simpli- 
fications must be incorporated in order to make the problem numerically tractable. 
It is mainly these simplifying assumptions that make the differences between the 
different approaches to address quantum transport. 

Although there are evidences that transport in MOSFETs with lOnm channels 
may still be governed by scattering, it is instructive to begin with a purely coher- 
ent carrier propagation in order to highlight the importance of quantum-mechanical 
effects in transport direction in short channel devices. 



12.5.1 Ballistic Transport and Tunneling 

A MOSFET utilizing ballistic transport was first suggested and studied by Natori 
[123]. It consists of a thin Si film connected to two reservoirs. One or two gate 
electrodes are adjusted close to the side film interfaces. The gate electrodes are 
electrically separated from the Si film by a dielectric material. In order to reduce 
scattering in the channel the Si film is not doped. 

The occupation of left- or right-propagating transversal modes is determined by 
the chemical potential of the left and the right reservoir, respectively. In equilib- 
rium, the chemical potentials of both reservoirs are the same and the currents of 
the left- and right-propagating modes are equal in value and opposite in sign and 
thus compensate each other. The total current in equilibrium is zero. The source- 
drain voltage shifts the chemical potentials and creates a unbalance in occupation 
between the left- and right-propagating modes. The number of propagating modes 
is controlled by the potential barrier in the silicon film, which is determined by 
the gate voltage. This picture is valid as long as there is no (back)scattering of the 
propagating modes in the silicon channel. We note that, except for the filling of the 
propagating modes in accordance with the reservoirs' Fermi-Dirac distribution, we 
did not need quantum mechanics to describe the ballistic transport [124]. 

For ballistic transport the channel must be shorter than the mean free path. In Si at 
room temperature the mean free path is a few nanometers. With the silicon channel 
scaled down, the potential barrier profile V(x) along the channel becomes steeper 
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and close to its maximum at x = xq it can be approximated by a parabolic disper- 
sion V(x) = Vm — miwix — Xo)^ / 2. The characteristic curvature of the potential m 
increases with the channel length Lc decreased approximately as [106] 

Moc—^. (12.60) 

For large curvatures, quantum-mechanical tunneling of electrons under the barrier 
becomes possible. The total current through the channel is determined by the sum of 
currents from all propagating modes and of the contribution of the modes tunneling 
under the barrier. 

In order to estimate the curvature cd beyond which transport is becoming deter- 
mined by quantum-mechanical tunneling, we use the Kemble formula for the 
transmission coefficient close to the maximum of the potential (12.60), follow- 
ing [106]: 



T(E) 



1 -I- exp 



(-^): 



Electrons are injected according to the Fermi-Dirac distribution 



ME) = 



1 -I- exp I 



kuT 



(12.61) 



(12.62) 



The current is determined by carriers with E > Vm flying above the potential 
maximum, if hw/lit < T . In this case tunneling can be neglected and transport 
is computed semi-classically. At the opposite limit fica/ln > ksT, the current 
must be computed quantum-mechanically, because the main contribution is due to 
carriers with E < Vm tunneling under the barrier. For typical parameters of Si 
and Vf„ = 50 meV the channel length beyond which tunneling under the barrier 
becomes important is around 10 nm. 

Due to the absence of scattering in the channel and excellent channel control, 
the double-gate ballistic MOSFET can be considered as an ultimate MOSFET. In 
order to describe transport accurately, we assume (100) orientation of the Si chan- 
nel at the Si/Si02 interface. As already discussed, a self-consistent solution of the 
two- or three-dimensional Schrodinger equation together with the Poisson equation 
represents a significant computational challenge, cf. [104]. Because of the strong 
lateral confinement in z direction due to the channel thickness t = 2 nm, only the 
first subband with the heavy mass m/, is populated with carriers which participate 
in transport for moderately high drain-source voltages. The wave function can then 
be written in the following approximate form: 

■I 2 TIT 

W(x,y,z) = iA(x)e'*M-cos(— ). (12.63) 
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This approximation simplifies the calculations and reduces the computational effort 
significantly [20, 66, 166, 192]. The effective Schrodinger equation for the function 
if{x) along the channel has the usual form Hilr = E-f, with the Hamiltonian H 

t 
H = --—if(x)+ - / dzcos[ — )eV{x,z), (12.64) 



where the potential V(x,z) is found from the Poisson equation. The Schrodinger 
equation must be supplemented with the boundary conditions. To describe an open 
system, one uses the boundary conditions corresponding to plane waves propagating 
in electrodes far from the channel. The Schrodinger equation can be solved numer- 
ically [190] or using the WKB semi-classical approximation [170]. The solution of 
the Schrodinger equation with open boundary conditions can also be achieved by 
means of the quantum transmitting boundary method [49, 105]. Simulators account- 
ing for a full two-dimensional solution of the open-boundary Schrodinger equation 
have been reported and applied to lOnm double-gate MOSFETs [104, 147]. 

A Schrodinger-Poisson solver [188] with open boundary conditions can be used 
to find the wave functions self-consistently. As a result, the transmission func- 
tion through the channel for each propagating mode is found. The total current is 
obtained by summing up the contributions from all modes propagating in forward 
direction from source to drain and in backward direction from drain to source. In 
an UTB FET when only a single subband is occupied, the current as function of a 
source-drain voltage V is [123] 

■JlmT [ ( (Ef-E\ (Ep-E-eVW 

(12.65) 

where Ei/2{x) is the Fermi-Dirac integral of the order 1/2. The structure of (12.65) 
is quite transparent. The two terms in the brackets correspond to supply functions 
from the source and from the drain electrode. Their difference, multiplied by the 
transmission function T(E), gives the current through the device. 

Figure 12.18 shows output characteristics of a transistor with fov = 1.5 nm, 
t = 2nm, for several values of gate voltage [170]. Two sets of IVs are pre- 
sented corresponding to the gate lengths of 5nm and 10 nm. For lOnm channel 
length / V characteristics are very similar to the characteristics of the ideal tran- 
sistor, see Fig. 12.18a, with a perfect saturation, suggesting an almost vanishing 
Drain-Induced Barrier Lowering (DIBL) effect. The lOnm transistor is therefore 
very close to the ideal double-gate ballistic transistor, described in [123]. With the 
decreased gate length the current saturation becomes less pronounced. The reason 
for this degradation is twofold: tunneling under the barrier formed by the potential 
in the channel and drain-induced barrier lowering. The characteristics are accept- 
able for L = 5nm,seeFig. 12.18b, while for L = 2.5nm the saturation practically 
vanishes. The output characteristics simulated for an ultra-thin body double-gate 
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Fig. 12.18 Output characteristics of a DG FET with 2nm Si body and 1.5 nm silicon dioxide 
thickness, (a) Gate length lOnm and (b) gate length 5nm. With the gate length decreased, the 
current saturation is less pronounced 



MOSFET with a gate length L as short as 5 nm shows that even such a small device 
possesses an /on/^otf ratio sufficient for logic applications and displays a reason- 
able short-channel effect and acceptable DIBL, a conclusion recently reached from 
more detailed atomistic calculations [146]. It should be noted that the sensitivity to 
small MOSFET dimension variations, the control of doping as well as the whole 
manufacturing process represent significant challenges for multi-gate MOSFETs 
with a gate length below 10 nm. 

Using the ballistic transport model and neglecting tunneling under the barrier 
(length L > 10 nm), one can evaluate influence of strain-induced electron sub- 
band structure modification discussed in Chap. 1 1 on transport in ultra-thin body 
[110]/(001) MOSFETs. We are using stress-dependence of the subband energies 
and the effective masses evaluated numerically in Sect. 1 1.4.3. 

For the thick film (t = 10.9nm) the effective masses closely follow the ana- 
lytical expressions (9.9, 9.10) [169], obtained by neglecting the valley coupling. 
The valley coupling caused by strain and reduced thickness introduces a substan- 
tial difference in the effective masses of the two unprimed subbands with the same 
quantum number, which are usually treated as completely equivalent. Even more, 
for t = 2.7 nm a clear difference in masses is already observed in relaxed films 
visible in Fig. 11.16, which becomes high in an ultra-thin film with the thickness 
t = 1.4nm(Fig. 11.17). 

Examples of the IVs for t = 10.9 nm and t = 1.4nm at the gate voltage Vq = 
0.5 V are shown in Figs. 12.19 and 12.20, correspondingly. The enhancement of 
the drive current for all Vds with shear strain is observed. In order to characterize 
the enhancement quantitatively, we have evaluated the relative current increase as 
(I(£xy, V) — Io{V))/IoiV) for each source-drain voltage. Results for different film 
thicknesses are shown in Figs. 12.21-12.24. 
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Fig. 12.19 The IV characteristics of a double-gate ballistic MOSFET with a silicon body of 
10.9 nm thickness as function of shear strain at Va = 0.5 eV. Strain is increased from to 2% in 
steps of 0.1% 
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Fig. 12.20 The / V characteristics of a double-gate ballistic MOSFET with a silicon body of 
1.4 nm thickness as function of shear strain at Vq = 0.5 eV. Strain is increased from to 2% in 
steps of 0.1% 
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Fig. 12.21 Relative current enhancement (/(Ej,., V) — loiV))/ Io(V) as function of shear strain 
for a double-gate MOSFET with the body thickness t = 10.9 nm. Dotted lines are obtained using 
the analytical model (9.9), (9.10) for the effective masses 
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Fig. 12.22 Relative current enhancement (/(£vy. ^) ~ Io{y))/ hiV) as function of shear strain 
for a double-gate MOSFET with body thickness ( = 5.4 nm. As for t = 10.9 nm body thickness, 
the enhancement is larger in the linear regime 
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Fig. 12.23 Relative current enhancement (/(Sv,,, V) — Iq{V))/ Iq{V) as function of shear strain 
for a double-gate MOSFET with body thickness ( = 2.7 nm. Contrary to the MOSFETs with 
thicker bodies,the enhancement becomes larger at saturation 
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Fig. 12.24 Relative current enhancement {I{Eyy. V) — Io{V))/ Io(V) as function of shear strain 
for a double-gate MOSFET with body thickness ; = 1 .4 nm. Due to strain-induced subband split- 
ting shear strain is extremely efficient for the enhancement of the on-current in MOSFETs with 
ultra-thin bodies 
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For small shear strain values the current enhancement is bias independent for 
all studied film thicknesses. For extremely high strain the saturation of the current 
enhancement is observed. For intermediate strain values the enhancement behaves 
differently in thick and thin silicon films at high and low voltages. 

In thicker films the enhancement in the linear regime is larger than in saturation 
(Figs. 12.21 and 12.22). Figure 12.21 shows the current increase with the values 
from Fig. 11.14 for the effective masses and with the masses (9.9) and (9.10) in 
bulk silicon, which demonstrates that the enhancement in a f = 10.9 nm film is due 
to the effective mass modifications with strain. For small strain values the current 
enhancement is due to the rriy increase. However, for larger strain the density-of- 
states effective mass ^nxxmy starts increasing. For fixed gate voltage it lowers the 
chemical potential <^, thus reducing the current. 

At saturation only half of the states flowing from source to drain is filled. For a 
fixed gate voltage it results in a higher chemical potential </> compared to the linear 
current regime, where the difference in states filling flowing from source to drain and 
in opposite direction is small. The decrease of </> due to the strain-induced increase in 
the density-of-states is thus more pronounced in saturation leading to a substantially 
smaller current enhancement. 

In thin silicon films the behavior of the enhancement is reversed (Figs. 12.23 
and 12.24). This cannot be explained by the change in the subband effective masses 
alone. The main reason is the strain-induced energy splitting between the unprimed 
subbands with the same quantum number [169]. The splitting increases with strain 
and is particularly large in thin films. Because of this splitting the density of states 
decreases with strain prompting an increase in the chemical potential and current. 
In saturation the increase of (/> is larger than in the linear regime guaranteeing the 
large drive current enhancement. 

The methods described so far are either based on the assumption of semi-classical 
or pure quantum-mechanical ballistic transport. The former modeling approach has 
proven to be adequate to describe transport in previous generations of microelec- 
tronic devices. The latter one may be used for transport description, when the carrier 
coherence length is larger than the device size. Recent studies show that even for 
devices with a channel length as short as 15 nm scattering may still play a significant 
role [134]. An adequate transport model for ultra-scaled MOSFETs must therefore 
account for quantum-mechanical and dissipative effects simultaneously. 



12.5.2 Quantum Transport Models with Scattering 

Alternative approaches which can handle both quantum-mechanical and dissipa- 
tive scattering effects are based on the reduced density matrix [39, 40] and on the 
Wigner function [48, 89, 196]. The evolution for the reduced electron density matrix 
is described by a quantum Liouville equation. It is obtained from the equation of 
motion for the total density matrix by tracing out the degrees of freedom of the envi- 
ronment. To simplify the equation, the limit of weak interaction between the device 
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and the environment is applied. Memory effects can be neglected, if one is inter- 
ested in system evolution for times larger than the reservoir correlation time [109]. 
This coarse-graining in time results in a simplified, Markovian description, when 
the future dynamics of the system is determined by its current state and is indepen- 
dent from the past. The resulting quantum Liouville equation for the reduced density 
matrix is of Lindblad form and describes the quantum evolution of a system with 
loss [107]. The Lindblad form guarantees positive definiteness, normalization, and 
hermiticity of the reduced density matrix. 

Coupling to contacts introduces carrier exchange between device and reservoirs. 
The difference between the electrochemical potentials of the reservoirs causes cur- 
rent flow through the device. In the approach developed by Fischetti, based on 
the Pauli master equation, this coupling is introduced in a phenomenological man- 
ner [39,40]. Assuming a flat potential profile within each reservoir, electron wave 
functions are plane waves with well defined momentum. All electron states may be 
separated in two distinct groups, representing electrons traveling towards and away 
from the device, respectively. The role of each reservoir is to supply the incident 
wave functions with occupation determined by the equilibrium distribution within 
the reservoir. Therefore, the basis of choice to analyze the dynamics of the reduced 
density matrix is formed by the states incoming from each reservoir. These scatter- 
ing states are obtained by a numerical solution of the Schrodinger equation subject 
to open boundary conditions. Without dissipation in the system, i.e., when the evo- 
lution is governed by the system Hamiltonian, the density matrix is diagonal in the 
basis of scattering states. Occupation of each scattering state is determined by the 
equilibrium distribution within the reservoirs. Once the occupations of scattering 
states are known, the quantum ballistic current is readily determined. In the pres- 
ence of weak dissipation, when the device length is shorter than the phase coherence 
length, the state occupations are found from a solution of the Pauli master equa- 
tion, which contains only diagonal elements of the density matrix [39,40]. Carrier 
density and current density are calculated from the occupations of the scattering 
states. The Pauli master equation and the Poisson equation are solved repeatedly in 
a self-consistent iteration loop. In this way subband quantization and rapid poten- 
tial variations in the transport direction are taken into account self-consistently. The 
eigenfunctions are used to compute transition rates among the corresponding states. 
Application of the Pauli master equation is restricted to stationary systems, since in 
the non-stationary case current continuity would be violated [48]. 

An interesting solution free from the above mentioned shortcoming of phe- 
nomenological coupling of the device to the reservoirs was recently suggested by 
Gebauer and Car [50,51]. They suggest to impose periodic boundary conditions 
upon the non-perturbed system. A constant electric field is introduced into the 
system via the vector-potential term, which linearly depends on time. The mas- 
ter equation is solved in two steps. First, the Hamiltonian dynamics of the density 
matrix is evaluated for a certain time-step t. During this time evolution the vector 
potential grows linearly with time. It can be scaled out from the Hamiltonian by 
means of a guage transformation [72] at the expense of an additional phase factor in 
the wave function. The time-step at which the guage transformation is performed is 
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chosen such that the phase added to the wave function satisfies the periodic bound- 
ary condition. In the second step the change of the density matrix due to the Lindblad 
scattering operator is evaluated. This procedure is repeated until a steady state is 
achieved. Since the eigenfunctions of the periodic system do not carry any current, 
the current in the Gebauer and Car formulation is due to off-diagonal elements. 
This approach can also be used to describe transients. The only limitation is that the 
time scale is much larger than the reservoir coherence time, a condition arising from 
coarse graining. It has been shown that in addition to the usual Hamiltonian current 
component a dissipative current component due to interaction with the environment 
appears. The total current is then conserved also for transients. 

Another approach capable of handling both quantum coherent propagation and 
dissipative scattering effects is based on the Wigner function. The Wigner function 
is defined as the density matrix in a mixed coordinate/momentum representa- 
tion [89, 196]. A practically used approximation to incorporate realistic scattering 
processes into the Wigner equation is to utilize a properly adapted Boltzmann 
scattering operator [92]. In this way well established scattering models already 
calibrated within semi-classical transport approaches can be employed in quan- 
tum transport calculations. The inclusion of dissipation through the Boltzmann 
scattering operator, although intuitively appealing, raises some concerns about the 
validity of such a procedure. The Boltzmann scattering operator is semi-classical 
by its nature, and represents a good approximation for sufficiently smooth device 
potentials. 

To account for scattering more rigorously, spectral information has to be included 
into the Wigner function, resulting in an energy-dependence in addition to the 
momentum dependence [114]. An alternative approach to construct quantum- 
mechanical extensions of the semi-classical electron-phonon scattering operator 
has been reported in [13, 23]. These quantum collision operators satisfy a quantum 
H-theorem and relax systems towards quantum equilibria. 

The kinetic equation for the Wigner function is similar to the semi-classical 
Boltzmann equation, except for a non-local potential term. In the case of a slowly 
varying potential this non-local term reduces to the local classical force term, 
and the semi-classical description given by the Boltzmann equation is obtained 
from the Wigner equation. This semi-classical limit of the Wigner transport equa- 
tion allows to link seamlessly a semi-classical description of the extended contact 
regions with the quantum description of the active region of a device using the same 
formalism [92]. 

Reports on finite difference solutions of the one-particle Wigner equation for 
device applications were given by Ravaioli [144], Kluksdahl [87], and coworkers, 
and date back to the mid 1980s. Frensley [44-46] was the first who introduced 
boundary conditions on the Wigner function to model open quantum systems. 
Later, self-consistency was added to the Wigner equation solvers [47, 86]. Main 
and Haddad included a reduced Boltzmann scattering operator in transient Wigner 
function-based simulations [115]. Research on finite difference methods for the 
Wigner equation culminated in 1990 when review articles of Frensley [48] and Buot 
and Jensen [16] appeared. In 2002, implementations of Monte Carlo methods for 
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solving the Wigner device equation have been reported [125, 153]. With the finite 
difference method scattering was restricted to the relaxation time approximation and 
the momentum space to one dimension. 

The Monte Carlo method allows scattering processes to be included on a more 
detailed level, e.g., through the Boltzmann scattering operator, assuming a three- 
dimensional momentum-space [93, 154]. Realistic scattering processes can be easily 
embedded into the Wigner equation via Boltzmann-like scattering integrals, which 
turns out to be a good approximation. The Wigner function approach reduces to a 
semi-classical transport description in contacts providing an important advantage of 
a seamless treatment between classical and quantum-mechanical regions in device 
simulations [92]. 

The Wigner function is given by the density matrix in mixed representation [89, 
196] defined by the Wigner- Weyl transform 

/w(r,k,f) = p(r+ -,r--,fjexp(-/k-s)ds. 

The kinetic equation for the Wigner function is similar to the Boltzmann equation: 

(^ + V ■ V, Vw = y Kv(r, k' - k)/w(k', r, t)dk' + (^] . (12.66) 

The Wigner potential entering into the non-local operator in the right-hand side is 
defined as 

In case of slowly varying potentials the difference term in the right-hand side of 
(12.67) can be developed into the series. Keeping the first non- vanishing terms, one 
rewrites (12.67) as 



1 /-J /* 

Kv(r, k' - k) = -^^^ VrK (r) — j exp (-/(k' - k) • s) ds. (12. 



68) 



After substituting (12.68) into (12.66) the term with the non-local potential reduces 
to the classical force term. Following [53], one can introduce a spectral decomposi- 
tion of the potential profile V{x) into a slowly varying, classical component and a 
rapidly changing component treated quantum-mechanically. 

K(r) = Ki(r) + K„„(r). (12.69) 

This separation of the total potential into a smooth classical and a small quantum- 
mechanical contribution can improve the stability of a numerical solution method. 
The quantum-mechanical contribution may be moved into the right-hand side of the 
transport equation. Considered as a perturbation, the quantum-mechanical term can 
be interpreted as a quantum scattering integral. It allows to treat quantum effects on 
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equal footing with classical scattering: 

il^' ■ ^r-"^^ ■ V,) A = Jv.Ur, k' - k)/.(k', r, Odk' + (1^)^^^ . 

(12.70) 

The Wigner function formalism treats scattering and quantum-mechanical effects 
on equal footing through the corresponding scattering integrals [ 1 26] . By analogy to 
the Monte Carlo methods used for the Boltzmann transport equation, it is tempting 
to solve the quantum Wigner transport equation (12.70) by means of the Monte 
Carlo technique. Such a program was recently realized in [91, 96, 140, 141, 155, 
166]. However, since the kernel of the quantum scattering operator is not positively 
defined, the numerical weight of the particle trajectory increases rapidly, and the 
numerical stability of the trajectory-based Monte Carlo algorithm becomes a critical 
issue. A multiple trajectories method was suggested [92] to overcome this difficulty. 
In the algorithm developed the problem of a growing statistical weight of a single 
trajectory is addressed by creating an increasing number of trajectories with constant 
weights, which may assume positive and negative values. Being formally equivalent 
to the former method, the algorithm allows the annihilation of particles with similar 
statistical properties, introducing a possibility to control the number of trajectories. 

This method was recently applied to double-gate MOSFETs [166] and quantum 
wires [127]. In the coherent mode, where scattering is turned off, a comparison to 
conventional Schrodinger solvers can be performed. In order to estimate the tunnel- 
ing component of the current the Wigner Monte Carlo simulations were carried out 
for a MOSFET with a gate length of 10 nm, and a good agreement between the two 
approaches was found. 

The carrier concentration must be used to update the potential in the device by 
solving the Poisson equation. A superimposed iteration loop makes the Wigner- 
Poisson solver self-consistent. An example of self-consistent potentials for n-i-n 
Si structures with an intrinsic region of length W ranging from 20 nm to 2.5 nm, 
as calculated with the Wigner Monte Carlo method and the classical Monte Carlo 
method is shown in Fig. 12.25. The doping profile is assumed to increase gradually 
from the intrinsic channel to the highly doped contacts over the same distance W. 
Phonon and Coulomb scattering were included. As expected, for long W the clas- 
sical and quantum calculations yield similar results for the self-consistent potential. 
For W = 2.5 nm an extra space charge due to electrons tunneling under the bar- 
rier becomes important, which results in a potential barrier increase. Despite of 
the potential barrier increase, the current in self-consistent Wigner simulations is 
approximately 20% higher compared to its classical value found by a self-consistent 
solution of the Boltzmann and the Poisson equations. 

Relative differences between /wig and the current /ball computed for a "bal- 
listic" device with scattering inside the intrinsic and transition regions turned off 
is shown in Fig. 12.26. For W = 2.5 nm the relative differences in current due to 
quantum effects and scattering in the barrier are still of the order of 25% and cannot 
be neglected. 
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Fig. 12.26 Relative difference between currents calculated with the Wigner and Boltzmann 
Monte Carlo methods (diamonds) and calculated with the Wigner Monte Carlo method for n-i-n 
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The Wigner Monte Carlo method gives accurate results not only for single-barrier 
devices, but it can also be applied to purely quantum-mechanical systems such as 
resonant tunneling diodes [92]. A typical output characteristic of a GaAs resonant 
tunneling diode is shown in Fig. 12.27. Scattering with polar optical phonons as well 
as Coulomb scattering in the contacts is considered. A region of negative differen- 
tial resistance common to transport via a resonant level is clearly visible after the 
resonance peak at 250 mV applied voltage. A self-consistent solution of the Wigner 
transport and Poisson equation is mandatory for the correct determination of the 
resonance position due to charge accumulation at the cathode side of the resonant 
tunneling diode. Before the barrier, an accumulation layer forms, depending on the 
applied voltage, as seen in Fig. 12.28. This results in a voltage shift of the resonance 
peak of the /y characteristics shown in Fig. 12.27. Atypical distribution of the con- 
centration in resonance condition and off -resonance is presented in Fig. 12.28. The 
amount of charge localized in the potential well is much higher at resonance. This 
leads to a potential barrier increase and also contributes to the shift of the resonance 
peak. This example demonstrates the importance of quantum-mechanical effects for 
the simulation of properties of ultra-scaled devices. It also shows that space charge 
effects are of crucial importance for the accurate prediction of output characteristics 
of single- and double-barrier devices. 

Generalization of the Wigner equation-based transport models to include strain is 
pending because of additional complications due to inclusion of the band structure 
in the formalism. The Green's function method briefly discussed below allows to 
investigate stress dependent transport properties provided that the parameters of the 
tight-binding model which describes the band structure are properly calibrated to 
include strain. 



12.5.3 Non-Equilibrium Green's Function Method 

The Green's function method provides a powerful technique to evaluate the proper- 
ties of a many-body system both in thermodynamic equilibrium and non-equilibrium 
situations. The single-particle Green's function of a system allows evaluation of 
carrier density and current. The many-particle information about interactions is 
cast into self-energies. The perturbative solution of the equations for the Green's 
functions and self-energies is the key technique to evaluate the properties of an 
interacting system in external fields. 

Four types of Green's functions can be defined as the non-equilibrium statistical 
ensemble averages of the single particle correlation operator [1 14]: 

G>(r,?;r',f') = -ih-^f{r,t)fHr',t')), 

\Y \ , lYV , Jh (12.71) 

G'^ir, t; r', t') = 9(1 - r')[G>(r, f ; r', t') - G<(r, f ; r', t% 

G^{r, t\ r', t') = 6(t' - r)[G<(r, t\ r', t') - G>(r, f ; r', t')]. 
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Fig. 12.27 Typical IV curve of a resonant tunneling diode, calculated self-consistently (solid line), 
contrasted against a non-self-consistent characteristics. Charge accumulation before the potential 
barrier as well as a higher charge density inside the RTD between the potential barriers are respon- 
sible for the significant shift of the IV resonance peak, thus demonstrating the importance of space - 
charge effects 
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where 9{t) is the unit step function, and i/f^(r, t) and i/f(r, t) are the field operators 
creating or destroying a particle at point (r, ?) in space-time, respectively. Only three 
of these Green's functions are independent. 

Under steady-state conditions the equation of motion for the Green's functions 
can be written as [21]: 

[E - H{r)]G^ir, r': E) - f driS^{r, rj; E)G^iri, r'; E) = S(r - r'), (12.72) 
G^(r, r'; E) = f dri f drzG^ir, n; £')i;^(ri, rj: E)[G^{r2, r'; E)]\ (12.73) 

where H is the one-particle Hamiltonian, and S^, S^ , and S^ are the retarded, 
lesser, and greater self-energies, respectively. The total self energy contains con- 
tributions due to the coupling of the device to the contacts and other interactions, 
such as electron-phonon interaction, 1^ = Sc + ^ei-ph- The self-energies due to the 
coupling of the device to the contacts are only non-zero at the boundaries [21] 

Sl(E) = Tgctl (12.74) 

E<(E) = -2i^sm[E^]fciE), (12.75) 

E^(E) = +2i^sm[S^]{l - ME)), (12.76) 

where r is the coupling matrix between the device and the contact, gc is the surface 
Green's function [21], and fc{E) is the Fermi-Dirac distribution function at the 
contact c. Within the self-consistent Born approximation, the self-energies for the 
electron-phonon interaction are 



G"=(r,r';£'±^«qj), (12.77) 

G''ir,r';E^hco^j), (12.78) 

^ei-ph(r, r : £■) = --Td-phCr, r-E) + P I 



In 



(12.79) 



fico^j denotes the phonon energy of branch j at the wave-vector q, n^j is the aver- 
age phonon occupation number, D(^j is the electron-phonon interaction strength, 
yei-ph = i (^eT-ph ~ ^eT-ph) defines the broadening, and P / represents the principal 
part of the integration. The imaginary part of the retarded self-energy broadens the 
density of states, whereas the real part shifts it. The plus and minus signs in (12.77) 
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and (12.78) denote the phonon emission and absorption processes, respectively. 
Assuming that the environment stays in thermal equilibrium, n^j is given by 

1 

(12.80) 



The self-energy due to electron-phonon interaction comprises the contributions 
of elastic and inelastic scattering mechanisms, -S'e'Lph = S^i + ^inei- ^y definition 
the particle energy is conserved for elastic scattering. For inelastic scattering, the 
after-scattering energy is different from its initial value. The energy difference is 
carried away by a scattering mediator, e.g., by a phonon. For optical phonons the 
frequency dependence Wq on the wave vector q is weak and it has a finite value wq 
as q ^- 0, therefore carrier scattering with optical phonons is inelastic. Inelastic 
interaction of carriers with optical phonons is usually considered local [102]. In this 
case the self-energies can be written as 

K^iE) = J2 And,; (nsihcDj) +\±\) G^'^'iE ± hcoj), (12.81) 

KiiiE) = E ^'-1'./ (nB(fia>j} + ^ ± ^) G^'^iE T ficoj) . (12.82) 

The electron-phonon interaction strength is given by 

Di„,uj = —^ , (12.83) 

where trie is the mass of a carbon atom. 

For acoustic phonons Wqj- = vjq is proportional to q. Therefore, the acoustic 
phonon energy at small q is negligible compared to the carrier energy, and carrier 
scattering with acoustic phonons is frequently considered elastic. The corresponding 
self-energies for acoustic phonon interaction are 

S^^'^iE) = DlG^'\E), (12.84) 

kBT\Mf\^ 

Ai,; = —. . (12.85) 

InnicVj 

The non-equilibrium Green's function method addresses the quantum transport 
problem in the most consistent and complete way, however, the method is com- 
putationally complex and, with a few exceptions [117, 118, 173], usually applied 
to one-dimensional problems [103] and for a restricted set of scattering mech- 
anisms [172] only. Scattering requires the knowledge of the corresponding self- 
energies and thus complicates computations significantly [172]. The self-consistent 
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Born approximation for the self-energy is an extremely time consuming but nec- 
essary step, because it guarantees the current continuity. The convergence of the 
self-consistent iteration is a critical issue where fine resonances at some energies 
have to be resolved accurately [77, 136]. For that purpose an adaptive method for 
selecting the energy grid is essential [136]. 

The one-dimensional equation of motion for the Green's functions, (12.72) and 
(12.73), are solved for G*^'" and G^'" using a recursive Green's function algo- 
rithm [173]. Coupling to reservoirs [21, 171] is usually described by the contact 
self-energies (12.74). For numerical solutions it is convenient to transform the equa- 
tions for the Green's function in the eigen-mode representation [186]. Alternatively, 
a discretized version on a mesh in real space can be used [22]. This results in a tight- 
binding-like Hamiltonian defined on the mesh nodes. The carrier concentration at a 
node / of the spatial grid and the current density at the edge between the nodes / 
and / -I- 1 are given by 

— G^f{E), (12.86) 



lit "''' 



2q 



—29ie{G^jl^(E)t^^,j}, (12.87) 



where the factor 2 is due to the spin degeneracy. The equations for the Green's 
functions must be solved self-consistently with the Poisson equation [198]. 

Recently an atomistic and full-band quantum transport simulator designed for 
CMOS devices based on ultra-thin body silicon films and nanowires is being 
developed [112]. The simulator is able to resolve two- and three-dimensional 
Schrodinger-Poisson equations based on the sp^'d^s* semi-empirical tight-binding 
method. Carrier and current densities are obtained by injecting electrons and holes 
at different energies into the device and by solving the resulting system of equations 
in the non-equilibrium Green's function formalism. The sp^d^s* model was only 
recently calibrated [14] to include shear strain and results of current calculations in 
uniaxially stressed nanowires are expected to be available soon. 



12.5.4 Conclusion and Trends 

Transport modeling for TCAD applications has grown into a mature field of research, 
software development, and applications. Models of different complexity, precision 
and accuracy are offered and implemented in various commercial and academic 
TCAD tools. Starting with the drift-diffusion model and higher moments models, 
a complete hierarchy of transport models for semiconductor device simulation has 
been gradually constructed. Depending on parameter values and device scales, either 
semi-classical or quantum-mechanical transport description can be adopted. 
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Monte Carlo techniques are the basis for ultimate tools to obtain the solution 
of the Boltzmann transport equation with arbitrary scattering mechanisms. These 
methods require significant CPU resources and are relatively rarely used for indus- 
trial TCAD applications, when timely but perhaps less accurate results are of 
primary importance. Monte Carlo methods can easily be extended and generalized 
to incorporate strain-induced modifications in band structure and scattering rates. 
With quantum corrections carefully added, full-band Monte Carlo methods can pro- 
vide accurate results in strained ultra-scaled devices with strong size quantizations. 
Monte Carlo solvers are indispensable for calibration of the parameters of higher 
order moment models to include strain. 

With the channel size of MOSFETs in the decananometer region, the develop- 
ment of conceptually new future devices and architectures is becoming increas- 
ingly important. New nanoelectronic structures, utilizing carbon nanotubes, silicon 
nanowires, and molecules are considered to be prominent candidates for the post- 
CMOS era. Nanoelectronic devices are expected to complement and substitute some 
of the current CMOS functions. New physical principles for carrier propagations 
must be included in transport models for nanodevices. Indeed, at this small device 
size the geometrical spread of the carrier wave packet can no longer be ignored. 
Thus, the complete information about carrier dynamics inside the device including 
the phase of the wave function is needed. Although many particles with different 
energies and phases are injected into a device, the phases of an individual carrier 
may not average out completely, if the potential in transport direction is changing 
rapidly. The full quantum-mechanical description is needed, when the curvature of 
the potential close to its maximum is so big that tunneling under the potential barrier 
contributes significantly to the current. 

When the device channel length is smaller than the scattering length, current 
transport inside the device becomes ballistic. The scattering length is only sev- 
eral nanometers in modern MOSFETs operating at room temperature, therefore, 
even in ultra-scaled modern devices transport is not completely ballistic. If the 
potential changes rapidly enough on the scale of the carrier wave packet, a quantum- 
mechanical description of electron motion between two scattering events must 
be adopted. In the case when both the scattering length and the spread of the 
wave packet are comparable to the channel length, the interplay between coherent 
propagation and scattering determines transport. Methods for dissipative quan- 
tum transport are based on the non-equilibrium Green's function formalism, the 
Liouville/von-Neumann equation for the density matrix, and the kinetic equation for 
the Wigner function. A proper generalisation of the methods to incorporate stress 
is still pending. All the quantum methods for transport are time consuming, and 
highly accurate simulations of emerging nanoscale devices represent an outstanding 
modeling and computational challenge. Development of new efficient numerical 
algorithms as well as a comprehensive comparison between different quantum- 
mechanical models describing dissipative transport in open systems is mandatory 
and is currently on the research agenda. 

In modern microelectronic devices quantum effects are usually dominant in a 
small active region connected to relatively large, heavily doped contact areas, where 
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the carrier dynamics is essentially classical. Modern TCAD simulators must be able 
to incorporate both semi-classical and (dissipative) quantum-mechanical modeling 
approaches within the same simulation core. It is, therefore, necessary to have an 
accurate multiscale simulation technique, which is able to bridge the gap between 
semi-classical and quantum-mechanical simulations. 

With shrinking device dimensions, the demand for full three-dimensional accu- 
rate solvers for the coupled transport/Poisson equations [1 17, 179, 193] and atomistic 
based simulations [85, 112, 194] has grown significantly. With the advances in 
computer architectures, computational power and memory capabilities increasing, 
state-of-the art software, development of fast numerical algorithms and concep- 
tually new generic simulation platforms a fundamental breakthrough in speed, 
reliability, and accuracy of multiscale three-dimensional TCAD simulation tools is 
anticipated. This will facilitate an accurate design of the macroscopic transport prop- 
erties of emerging nanoelectronic and molecular devices based on their microscopic 
atomistic electronic characteristics. 
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